To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????ÆB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100011001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3fc642
SJIS-WIN 筌??而??夷∽?醫?????B 111000101010001100111111001111111000111010100111001111110011111110001000110011101000000111100100001111111110011111001110001111110011111100111111001111110011111101000010 e2a33f3f8ea73f3f88ce81e43fe7ce3f3f3f3f3f42
EUC-JP 筌??而??夷∽?醫????ÆB 1110010010100101001111110011111110111100101010010011111100111111101100001101000010100010111001100011111111101110110100000011111100111111001111110011111110001111101010011010000101000010 e4a53f3fbca93f3fb0d0a2e63feed03f3f3f3f8fa9a142
UTF-8 筌륁쥓而쎿쐣夷∽쭗醫딆퍤勵됱ÆB 111001111010110110001100111010111010010110000001111011001010010110010011111010001000000010001100111011001000111010111111111011001001000010100011111001011010010010110111111000101000100010111101111011001010110110010111111010011000011010101011111010111001010010000110111011011000110110100100111011111010010110111111111010111001000010110001110000111000011001000010 e7ad8ceba581eca593e8808cec8ebfec90a3e5a4b7e288bdecad97e986abeb9486ed8da4efa5bfeb90b1c38642
UHC 筌륁쥓而쎿쐣夷∽쭗醫딆퍤勵됱ÆB 11101111101001111000111111101100101000101000101011101100101110111001101111100110100111001000100111101100101010001010000111101111101001111000111111101100101000101000101011101100101110111001101111100101111110101000100111101100101010001010000101000010 efa78feca28aecbb9be69c89eca8a1efa78feca28aecbb9be5fa89eca8a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)