Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?@?j	00111111010000000011111101101010	3f403f6a
SJIS-WIN	叩@叩j	100100100100000001000000100100100100000001101010	92404092406a
EUC-JP	叩@叩j	110000111010000101000000110000111010000101101010	c3a140c3a16a
UTF-8	叩@叩j	1110010110001111101010010100000011100101100011111010100101101010	e58fa940e58fa96a
UHC	叩@叩j	110011011011000001000000110011011011000001101010	cdb040cdb06a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)