Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????M	0011111100111111001111110011111101001101	3f3f3f3f4d
SJIS-WIN	嶸ｵ嶸ｯM	11111010101101001011010111111010101101001010111101001101	fab4b5fab4af4d
EUC-JP	嶸ｵ嶸ｯM	1000111110111011111101001000111010110101100011111011101111110100100011101010111101001101	8fbbf48eb58fbbf48eaf4d
UTF-8	嶸ｵ嶸ｯM	11100101101101101011100011101111101111011011010111100101101101101011100011101111101111011010111101001101	e5b6b8efbdb5e5b6b8efbdaf4d
UHC	嶸?嶸?M	11100111101011100011111111100111101011100011111101001101	e7ae3fe7ae3f4d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)