Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	蠻隘霧珀	1110010111000101111010001010010110010110101101101110000011011111	e5c5e8a596b6e0df
EUC-JP	蠻隘霧珀	1110101011000111111100001010011111001100101110001110000011100001	eac7f0a7ccb8e0e1
UTF-8	蠻隘霧珀	111010001010000010111011111010011001101010011000111010011001110010100111111001111000111110000000	e8a0bbe99a98e99ca7e78f80
UHC	蠻隘霧珀	1101100011000001111001001111011011011001111101101101101011010110	d8c1e4f6d9f6dad6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)