Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	愁貞愚	100011110100010010010010111001011000101111110000	8f4492e58bf0
EUC-JP	愁貞愚	101111011010010111000100111001111011011011110010	bda5c4e7b6f2
UTF-8	愁貞愚	111001101000010010000001111010001011001010011110111001101000010010011010	e68481e8b29ee6849a
UHC	愁貞愚	111000011111111011101111111101101110100111010111	e1feeff6e9d7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)