To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 中??砥????苡 100100101000011000111111001111111001001101110101001111110011111100111111001111111110010010001111 92863f3f93753f3f3f3fe48f
EUC-JP 中??砥????苡 110000111110011000111111001111111100010111010110001111110011111100111111001111111110011111101111 c3e63f3fc5d63f3f3f3fe7ef
UTF-8 中淚쓱砥렡綎흗홧苡 111001001011100010101101111011111010010110001101111011001001001110110001111001111010000010100101111010111010000010100001111001111011011010001110111011011001110110010111111011011001100110100111111010001000101110100001 e4b8adefa58dec93b1e7a0a5eba0a1e7b68eed9d97ed99a7e88ba1
UHC 中淚쓱砥렡綎흗홧苡 111100011110100111010010111001111011111010110011111100101011001010001110101100101110111111110010110010001110100111001000101100011110110010111110 f1e9d2e7beb3f2b28eb2eff2c8e9c8b1ecbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)