To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???夷??淫??搖 00111111001111110011111110001000110011100011111100111111100010001111101000111111001111111001110110001010 3f3f3f88ce3f3f88fa3f3f9d8a
EUC-JP ???夷??淫??搖 00111111001111110011111110110000110100000011111100111111101100001111110000111111001111111101100111101010 3f3f3fb0d03f3fb0fc3f3fd9ea
UTF-8 琉욁걶夷듬틯淫잙쨰搖 111011111010011110001100111011001001101010000001111010101011000110110110111001011010010010110111111010111001001110101100111011011000101110101111111001101011011110101011111011001001111010011001111011001010100010110000111001101001000010010110 efa78cec9a81eab1b6e5a4b7eb93aced8bafe6b7abec9e99eca8b0e69096
UHC 琉욁걶夷듬틯淫잙쨰搖 1110101110100100100111101110001110000001100111001110110010101000101101011110101110111010100110011110101111100010100111111110101110100100100010101110100011110100 eba49ee3819ceca8b5ebba99ebe29feba48ae8f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)