To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???z???zB 001111110011111100111111011110100011111100111111001111110111101001000010 3f3f3f7a3f3f3f7a42
SJIS-WIN 辯呈現z辯呈現zB 111001111000011110010010111001101000110010111011011110101110011110000111100100101110011010001100101110110111101001000010 e78792e68cbb7ae78792e68cbb7a42
EUC-JP 辯呈現z辯呈現zB 111011011110011111000100111010001011100010111101011110101110110111100111110001001110100010111000101111010111101001000010 ede7c4e8b8bd7aede7c4e8b8bd7a42
UTF-8 辯呈現z辯呈現zB 111010001011111010101111111001011001000110001000111001111000111110111110011110101110100010111110101011111110010110010001100010001110011110001111101111100111101001000010 e8beafe59188e78fbe7ae8beafe59188e78fbe7a42
UHC 辯呈現z辯呈現zB 110111001010101011101111110100001111101011011110011110101101110010101010111011111101000011111010110111100111101001000010 dcaaefd0fade7adcaaefd0fade7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)