Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	堯警?邁	11101010100111111000110001111000001111111110011110110000	ea9f8c783fe7b0
EUC-JP	堯警?邁	11110100101000011011011111011001001111111110111010110010	f4a1b7d93feeb2
UTF-8	堯警렪邁	111001011010000010101111111010001010110110100110111010111010000010101010111010011000001010000001	e5a0afe8ada6eba0aae98281
UHC	堯警렪邁	1110100011101011110011001110110110001110101110001101100011100100	e8ebcced8eb8d8e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)