Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??h??	0011111100111111011010000011111100111111	3f3f683f3f
SJIS-WIN	雍広h雍広	111010001011010010001101010011000110100011101000101101001000110101001100	e8b48d4c68e8b48d4c
EUC-JP	雍広h雍広	111100001011011010111001101011010110100011110000101101101011100110101101	f0b6b9ad68f0b6b9ad
UTF-8	雍広h雍広	11101001100110111000110111100101101110101000001101101000111010011001101110001101111001011011101010000011	e99b8de5ba8368e99b8de5ba83
UHC	雍?h雍?	11101000101111000011111101101000111010001011110000111111	e8bc3f68e8bc3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)