Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	蠍ｸ讖ｲ蝗撮	11100101101101101011100011100110101010011011001011100101100110111000111001000010	e5b6b8e6a9b2e59b8e42
EUC-JP	蠍ｸ讖ｲ蝗撮	111010101011100010001110101110001110110010101011100011101011001011101001111110111011101110100011	eab88eb8ecab8eb2e9fbbba3
UTF-8	蠍ｸ讖ｲ蝗撮	111010001010000010001101111011111011110110111000111010001010111010010110111011111011110110110010111010001001110110010111111001101001001010101110	e8a08defbdb8e8ae96efbdb2e89d97e692ae
UHC	??讖?蝗撮	001111110011111111110011110110010011111111111100110110011111010111001001	3f3ff3d93ffcd9f5c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)