To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 烏≪?烏≪?B 1000100101000111100000011110000100111111100010010100011110000001111000010011111101000010 894781e13f894781e13f42
EUC-JP 烏≪?烏≪?B 1011000110101000101000101110001100111111101100011010100010100010111000110011111101000010 b1a8a2e33fb1a8a2e33f42
UTF-8 烏≪쬅烏≪쬅B 11100111100000111000111111100010100010011010101011101100101011001000010111100111100000111000111111100010100010011010101011101100101011001000010101000010 e7838fe289aaecac85e7838fe289aaecac8542
UHC 烏≪쬅烏≪쬅B 11101000101000011010000111101100101001101001110011101000101000011010000111101100101001101001110001000010 e8a1a1eca69ce8a1a1eca69c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)