Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???@	00111111001111110011111101000000	3f3f3f40
SJIS-WIN	受△?@	100011101111001110000001101000100011111101000000	8ef381a23f40
EUC-JP	受△?@	101111001111010110100010101001000011111101000000	bcf5a2a43f40
UTF-8	受△뼦@	11100101100011111001011111100010100101101011001111101011101111001010011001000000	e58f97e296b3ebbca640
UHC	受△뼦@	11100001111101001010000111100010100101101010100101000000	e1f4a1e296a940

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)