Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???B	00111111001111110011111101000010	3f3f3f42
SJIS-WIN	仲脛?B	100100101000011111100011111110000011111101000010	9287e3f83f42
EUC-JP	仲脛?B	110000111110011111100110111110100011111101000010	c3e7e6fa3f42
UTF-8	仲脛놔B	11100100101110111011001011101000100001001001101111101011100001101001010001000010	e4bbb2e8849beb869442
UHC	仲脛놔B	11110001111010101100110011101011101100111111011001000010	f1eaccebb3f642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)