Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	UNUN^	100100010101010110010111010011101001000101010101100101110100111001011110	9155974e9155974e5e
SJIS-WIN	?U?N?U?N^	001111110101010100111111010011100011111101010101001111110100111001011110	3f553f4e3f553f4e5e
EUC-JP	?U?N?U?N^	001111110101010100111111010011100011111101010101001111110100111001011110	3f553f4e3f553f4e5e
UTF-8	UNUN^	11000010100100010101010111000010100101110100111011000010100100010101010111000010100101110100111001011110	c29155c2974ec29155c2974e5e
UHC	?U?N?U?N^	001111110101010100111111010011100011111101010101001111110100111001011110	3f553f4e3f553f4e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)