Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????B	0011111100111111001111110011111101000010	3f3f3f3f42
SJIS-WIN	昔昔昔昔B	100100001100110010010000110011001001000011001100100100001100110001000010	90cc90cc90cc90cc42
EUC-JP	昔昔昔昔B	110000001100111011000000110011101100000011001110110000001100111001000010	c0cec0cec0cec0ce42
UTF-8	昔昔昔昔B	11100110100110001001010011100110100110001001010011100110100110001001010011100110100110001001010001000010	e69894e69894e69894e6989442
UHC	昔昔昔昔B	111000001010111011100000101011101110000010101110111000001010111001000010	e0aee0aee0aee0ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)