Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	癩蘢	11100001100110101110010101100011	e19ae563
EUC-JP	癩蘢	11100001111110101110100111000100	e1fae9c4
UTF-8	癩蘢	111001111001100110101001111010001001100010100010	e799a9e898a2
UHC	癩?	110101001111110100111111	d4fd3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)