Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	ï§??	11101111101001110011111100111111	efa73f3f
SJIS-WIN	?§‡蹕	00111111100000011001100010000001111101101110011101001010	3f819881f6e74a
EUC-JP	ï§‡蹕	100011111010101111000001101000011111100010100010111110001110110110101011	8fabc1a1f8a2f8edab
UTF-8	ï§‡蹕	11000011101011111100001010100111111000101000000010100001111010001011100110010101	c3afc2a7e280a1e8b995
UHC	?§‡?	001111111010000111010111101000101101010000111111	3fa1d7a2d43f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)