Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	闡ｯ蜀礼強	111010001001000110101111111001011000011010010111111001111000101110101101	e891afe58697e78bad
EUC-JP	闡ｯ蜀礼強	11101111111100011000111010101111111010011110011011001110111010011011011010101111	eff18eafe9e6cee9b6af
UTF-8	闡ｯ蜀礼強	111010011001011110100001111011111011110110101111111010001001110010000000111001111010010010111100111001011011110010110111	e997a1efbdafe89c80e7a4bce5bcb7
UHC	闡?蜀??	11110100110001010011111111110101101110010011111100111111	f4c53ff5b93f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)