Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	遡暹	10010001011010111001110111111001	916b9df9
EUC-JP	遡暹	11000001110011001101101011111011	c1ccdafb
UTF-8	遡暹	111010011000000110100001111001101001101010111001	e981a1e69ab9
UHC	遡暹	11100001110011111110000011100111	e1cfe0e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)