Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????B	0011111100111111001111110011111101000010	3f3f3f3f42
SJIS-WIN	予?予?B	10010111010111000011111110010111010111000011111101000010	975c3f975c3f42
EUC-JP	予?予?B	11001101101111010011111111001101101111010011111101000010	cdbd3fcdbd3f42
UTF-8	予럝予럝B	11100100101110101000100011101011100111111001110111100100101110101000100011101011100111111001110101000010	e4ba88eb9f9de4ba88eb9f9d42
UHC	予럝予럝B	111001011111100010001110011110101110010111111000100011100111101001000010	e5f88e7ae5f88e7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)