Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???B	00111111001111110011111101000010	3f3f3f42
SJIS-WIN	?癌?B	0011111110001010111000000011111101000010	3f8ae03f42
EUC-JP	?癌啞B	00111111101101001110001010001111101101011110000001000010	3fb4e28fb5e042
UTF-8	卨癌啞B	11100101100011011010100011100111100110011000110011100101100101011001111001000010	e58da8e7998ce5959e42
UHC	卨癌啞B	11100000110110011110010011011111111001001010111101000010	e0d9e4dfe4af42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)