Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	竣紺信檍?	100011110111011010001101101011101001000001001101100111101111100000111111	8f768dae904d9ef83f
EUC-JP	竣紺信檍?	101111011101011110111010101100001011111110101110110111001111101000111111	bdd7bab0bfaedcfa3f
UTF-8	竣紺信檍뻬	111001111010101110100011111001111011010010111010111001001011111110100001111001101010101010001101111010111011101110101100	e7aba3e7b4bae4bfa1e6aa8debbbac
UHC	竣紺信檍뻬	11110001111000101100101011111010111000111110000111100101111001011011101110111110	f1e2cafae3e1e5e5bbbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)