Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???h	00111111001111110011111101101000	3f3f3f68
SJIS-WIN	罐′鮎h	11100011101000111000000110001100100010001011110001101000	e3a3818c88bc68
EUC-JP	罐′鮎h	11100110101001011010000111101100101100001011111001101000	e6a5a1ecb0be68
UTF-8	罐′鮎h	11100111101111011001000011100010100000001011001011101001101011101000111001101000	e7bd90e280b2e9ae8e68
UHC	罐′鮎h	11001110101110001010000111000111111011111100011001101000	ceb8a1c7efc668

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)