Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	澹紮囹	111000000101011111100011010001101001101010011010	e057e3469a9a
EUC-JP	澹紮囹	110111111011100011100101101001111101001111111010	dfb8e5a7d3fa
UTF-8	澹紮囹	111001101011111010111001111001111011010010101110111001011001101110111001	e6beb9e7b4aee59bb9
UHC	澹紮囹	110100111100001011110011110011111101011010110111	d3c2f3cfd6b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)