Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	鉤蛾ｧ亥竅闍	1110011111101010100010011110100110100111100010001110010111100010100000011110100010001011	e7ea89e9a788e5e281e88b
EUC-JP	鉤蛾ｧ亥竅闍	111011101110110010110010111010111000111010100111101100001110011111100011111000011110111111101011	eeecb2eb8ea7b0e7e3e1efeb
UTF-8	鉤蛾ｧ亥竅闍	111010011000100110100100111010001001101110111110111011111011110110100111111001001011101010100101111001111010101110000101111010011001011110001101	e989a4e89bbeefbda7e4baa5e7ab85e9978d
UHC	鉤蛾?亥竅?	11001111110010011110010010110110001111111111101010100100110100001010101100111111	cfc9e4b63ffaa4d0ab3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)