Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	億顎	10001001101011011000101001111011	89ad8a7b
EUC-JP	億顎	10110010101011111011001111011100	b2afb3dc
UTF-8	億顎	111001011000010010000100111010011010000110001110	e58484e9a18e
UHC	億顎	11100101111000101110010011001001	e5e2e4c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)