Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	ェ烋	10000011010001101110000001111101	8346e07d
EUC-JP	ェ烋	10100101101001111101111111011110	a5a7dfde
UTF-8	ェ烋	111000111000001010100111111001111000001110001011	e382a7e7838b
UHC	ェ烋	10101011101001111111110111001110	aba7fdce

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)