Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???AP	0011111100111111001111110100000101010000	3f3f3f4150
SJIS-WIN	炭卒捉AP	1001001001011001100100011011001010010001101010000100000101010000	925991b291a84150
EUC-JP	炭卒捉AP	1100001110111010110000101011010011000010101010100100000101010000	c3bac2b4c2aa4150
UTF-8	炭卒捉AP	1110011110000010101011011110010110001101100100101110011010001101100010010100000101010000	e782ade58d92e68d894150
UHC	炭卒捉AP	1111011110101001111100001110111111110011101101010100000101010000	f7a9f0eff3b54150

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)