Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	癬雁	11100001100111011000101011100101	e19d8ae5
EUC-JP	癬雁	11100001111111011011010011100111	e1fdb4e7
UTF-8	癬雁	111001111001100110101100111010011001101110000001	e799ace99b81
UHC	癬雁	11100000110010001110010011010010	e0c8e4d2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)