Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	霓､蟇よ純	111010001011110110100100111001011010111110000010111001101000111110000011	e8bda4e5af82e68f83
EUC-JP	霓､蟇よ純	11110000101111111000111010100100111010101011000110100100111010001011110111100011	f0bf8ea4eab1a4e8bde3
UTF-8	霓､蟇よ純	111010011001110010010011111011111011110110100100111010001001111110000111111000111000001010001000111001111011010010010100	e99c93efbda4e89f87e38288e7b494
UHC	霓??よ純	1110011111100111001111110011111110101010111010001110001011101101	e7e73f3faae8e2ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)