To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 中??砥?????苡 10010010100001100011111100111111100100110111010100111111001111110011111100111111001111111110010010001111 92863f3f93753f3f3f3f3fe48f
EUC-JP 中??砥?????苡 11000011111001100011111100111111110001011101011000111111001111110011111100111111001111111110011111101111 c3e63f3fc5d63f3f3f3f3fe7ef
UTF-8 中淚쓱砥렡綎흖렭렲苡 111001001011100010101101111011111010010110001101111011001001001110110001111001111010000010100101111010111010000010100001111001111011011010001110111011011001110110010110111010111010000010101101111010111010000010110010111010001000101110100001 e4b8adefa58dec93b1e7a0a5eba0a1e7b68eed9d96eba0adeba0b2e88ba1
UHC 中淚쓱砥렡綎흖렭렲苡 1111000111101001110100101110011110111110101100111111001010110010100011101011001011101111111100101100100011101000100011101011101010001110101111111110110010111110 f1e9d2e7beb3f2b28eb2eff2c8e88eba8ebfecbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)