Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????B	0011111100111111001111110011111101000010	3f3f3f3f42
SJIS-WIN	仙昔仙誓B	100100001110010110010000110011001001000011100101100100001011111001000010	90e590cc90e590be42
EUC-JP	仙昔仙誓B	110000001110011111000000110011101100000011100111110000001100000001000010	c0e7c0cec0e7c0c042
UTF-8	仙昔仙誓B	11100100101110111001100111100110100110001001010011100100101110111001100111101000101010101001001101000010	e4bb99e69894e4bb99e8aa9342
UHC	仙昔仙誓B	111000001011100111100000101011101110000010111001111000001010010101000010	e0b9e0aee0b9e0a542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)