Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???C	00111111001111110011111101000011	3f3f3f43
SJIS-WIN	題?緋C	100100011110100000111111100101001110101001000011	91e83f94ea43
EUC-JP	題?緋C	110000101110101000111111110010001110110001000011	c2ea3fc8ec43
UTF-8	題띄緋C	11101001101000011000110011101011100111011000010011100111101101111000101101000011	e9a18ceb9d84e7b78b43
UHC	題띄緋C	11110000101110011011011011100111110111011111110001000011	f0b9b6e7ddfc43

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)