To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???z???zB 001111110011111100111111011110100011111100111111001111110111101001000010 3f3f3f7a3f3f3f7a42
SJIS-WIN 聶х,z聶х,zB 111000111110000110000100100001111000000101000011011110101110001111100001100001001000011110000001010000110111101001000010 e3e1848781437ae3e1848781437a42
EUC-JP 聶х,z聶х,zB 111001101110001110100111111001111010000110100100011110101110011011100011101001111110011110100001101001000111101001000010 e6e3a7e7a1a47ae6e3a7e7a1a47a42
UTF-8 聶х,z聶х,zB 11101000100000011011011011010001100001011110111110111100100011000111101011101000100000011011011011010001100001011110111110111100100011000111101001000010 e881b6d185efbc8c7ae881b6d185efbc8c7a42
UHC ?х,z?х,zB 00111111101011001110011110100011101011000111101000111111101011001110011110100011101011000111101001000010 3face7a3ac7a3face7a3ac7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)