Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????h?	001111110011111100111111001111110110100000111111	3f3f3f3f683f
SJIS-WIN	錮滂ｽ･h錮	111001111111110010011111111011111011110110100101011010001110011111111100	e7fc9fefbda568e7fc
EUC-JP	錮滂ｽ･h錮	1110111011111110110111101111000110001110101111011000111010100101011010001110111011111110	eefedef18ebd8ea568eefe
UTF-8	錮滂ｽ･h錮	11101001100011001010111011100110101110111000001011101111101111011011110111101111101111011010010101101000111010011000110010101110	e98caee6bb82efbdbdefbda568e98cae
UHC	錮滂??h錮	110011011101000111011011101101010011111100111111011010001100110111010001	cdd1dbb53f3f68cdd1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)