Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???@B	0011111100111111001111110100000001000010	3f3f3f4042
SJIS-WIN	?功ダ@B	00111111100011001111011110000011010111110100000001000010	3f8cf7835f4042
EUC-JP	?功ダ@B	00111111101110001111100110100101110000000100000001000010	3fb8f9a5c04042
UTF-8	룴功ダ@B	1110101110100011101101001110010110001010100111111110001110000011100000000100000001000010	eba3b4e58a9fe383804042
UHC	룴功ダ@B	1000111110101001110011011110110110101011110000000100000001000010	8fa9cdedabc04042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)