Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	鉤ｹ荳委	11100111111010101011100111100100101110001000100011001111	e7eab9e4b888cf
EUC-JP	鉤ｹ荳委	1110111011101100100011101011100111101000101110101011000011010001	eeec8eb9e8bab0d1
UTF-8	鉤ｹ荳委	111010011000100110100100111011111011110110111001111010001000110110110011111001011010011110010100	e989a4efbdb9e88db3e5a794
UHC	鉤?荳委	11001111110010010011111111010100111001011110101011001101	cfc93fd4e5eacd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)