Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	鈺??湲??宋	11111011110001000011111100111111100111111101000100111111001111111001000101110110	fbc43f3f9fd13f3f9176
EUC-JP	鈺??湲??宋	1000111111100011110101010011111100111111110111101101001100111111001111111100000111010111	8fe3d53f3fded33f3fc1d7
UTF-8	鈺쎈맢湲썽쪛宋	111010011000100010111010111011001000111010001000111010111010011110100010111001101011100110110010111011001000110110111101111011001010101010011011111001011010111010001011	e988baec8e88eba7a2e6b9b2ec8dbdecaa9be5ae8b
UHC	鈺쎈맢湲썽쪛宋	1110100010101101101111011110101110010000101011101110101010111000101111011110100110100101100101001110000111100100	e8adbdeb90aeeab8bde9a594e1e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)