To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???肄??醫??壤???????裕?? 001111110011111100111111111000111110010100111111001111111110011111001110001111110011111110011010110111110011111100111111001111110011111100111111001111110011111110010111010101000011111100111111 3f3f3fe3e53f3fe7ce3f3f9adf3f3f3f3f3f3f3f97543f3f
EUC-JP ???肄??醫??壤??堉????裕?? 0011111100111111001111111110011011100111001111110011111111101110110100000011111100111111110101001110000100111111001111111000111110110111111111010011111100111111001111110011111111001101101101010011111100111111 3f3f3fe6e73f3feed03f3fd4e13f3f8fb7fd3f3f3f3fcdb53f3f
UTF-8 捻뚭여肄뽩쉽醫덈늿壤깆쥉堉팋捻뚭였裕덂쉽 111011111010011010100100111010111001101010101101111011001001011110101100111010001000001010000100111010111011110110101001111011001000100110111101111010011000011010101011111010111000110110001000111010111000101010111111111001011010001110100100111010101011100110000110111011001010010110001001111001011010000010001001111011011000110010001011111011111010011010100100111010111001101010101101111011001001100010000000111010001010001110010101111010111000110110000010111011001000100110111101 efa6a4eb9aadec97ace88284ebbda9ec89bde986abeb8d88eb8abfe5a3a4eab986eca589e5a089ed8c8befa6a4eb9aadec9880e8a395eb8d82ec89bd
UHC 捻뚭여肄뽩쉽醫덈늿壤깆쥉堉팋捻뚭였裕덂쉽 11100110111101111000110011101010101111111010100111101100101111011001011011100101101111011011000111101100101000101000100011101011100010001000100011100101101111011011000111101100101000101000001011101011101111001011101101001101111001101111011110001100111010101011111110110100111010111010111010001000111001011011110110110001 e6f78ceabfa9ecbd96e5bdb1eca288eb8888e5bdb1eca282ebbcbb4de6f78ceabfb4ebae88e5bdb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)