Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	讌ｮ彧疲ｦｧ	111001101010010110101110111110101011100110010100111001101010011010100111	e6a5aefab994e6a6a7
EUC-JP	讌ｮ彧疲ｦｧ	11101100101001111000111010101110100011111011110011111110110010001110100010001110101001101000111010100111	eca78eae8fbcfec8e88ea68ea7
UTF-8	讌ｮ彧疲ｦｧ	111010001010111010001100111011111011110110101110111001011011110110100111111001111001011010110010111011111011110110100110111011111011110110100111	e8ae8cefbdaee5bda7e796b2efbda6efbda7
UHC	??彧疲??	0011111100111111111010011110111011111001101010100011111100111111	3f3fe9eef9aa3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)