Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	蕣伎	11100100111110101000101011101010	e4fa8aea
EUC-JP	蕣伎	11101000111111001011010011101100	e8fcb4ec
UTF-8	蕣伎	111010001001010110100011111001001011110010001110	e895a3e4bc8e
UHC	蕣伎	11100010111100101101000011101011	e2f2d0eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)