Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	魍磯＃螟	1110100110110001100010001110100110000001100101001110010110100100	e9b188e98194e5a4
EUC-JP	魍磯＃螟	1111001010110011101100001110101110100001111101001110101010100110	f2b3b0eba1f4eaa6
UTF-8	魍磯＃螟	111010011010110110001101111001111010001110101111111011111011110010000011111010001001111010011111	e9ad8de7a3afefbc83e89e9f
UHC	?磯＃螟	00111111110100011011010010100011101000111101100110101101	3fd1b4a3a3d9ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)