Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????^	0011111100111111001111110011111101011110	3f3f3f3f5e
SJIS-WIN	魯斛魯?^	1001100001000100100111011100110110011000010001000011111101011110	98449dcd98443f5e
EUC-JP	魯斛魯?^	1100111110100101110110101100111111001111101001010011111101011110	cfa5dacfcfa53f5e
UTF-8	魯斛魯씹^	11101001101011011010111111100110100101101001101111101001101011011010111111101100100101001011100101011110	e9adafe6969be9adafec94b95e
UHC	魯斛魯씹^	110101101101101111001101110101111101011011011011101111101100001101011110	d6dbcdd7d6dbbec35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)