Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	紆紺頗?轤	111000101111110010001101101011101001000010011100001111111110011110000010	e2fc8dae909c3fe782
EUC-JP	紆紺頗繐轤	1110010011111110101110101011000010111111111111001000111111010100110100111110110111100010	e4febab0bffc8fd4d3ede2
UTF-8	紆紺頗繐轤	111001111011010010000110111001111011010010111010111010011010000010010111111001111011100110010000111010001011110110100100	e7b486e7b4bae9a097e7b990e8bda4
UHC	紆紺頗??	1110100111100001110010101111101011110111111101100011111100111111	e9e1cafaf7f63f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)