Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?@?@B	0011111101000000001111110100000001000010	3f403f4042
SJIS-WIN	叩@叩@B	10010010010000000100000010010010010000000100000001000010	92404092404042
EUC-JP	叩@叩@B	11000011101000010100000011000011101000010100000001000010	c3a140c3a14042
UTF-8	叩@叩@B	111001011000111110101001010000001110010110001111101010010100000001000010	e58fa940e58fa94042
UHC	叩@叩@B	11001101101100000100000011001101101100000100000001000010	cdb040cdb04042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)