Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??C??C^	00111111001111110100001100111111001111110100001101011110	3f3f433f3f435e
SJIS-WIN	鶯痔C鶯痔C^	1110100111110010100011101010010001000011111010011111001010001110101001000100001101011110	e9f28ea443e9f28ea4435e
EUC-JP	鶯痔C鶯痔C^	1111001011110100101111001010011001000011111100101111010010111100101001100100001101011110	f2f4bca643f2f4bca6435e
UTF-8	鶯痔C鶯痔C^	111010011011011010101111111001111001011110010100010000111110100110110110101011111110011110010111100101000100001101011110	e9b6afe7979443e9b6afe79794435e
UHC	鶯痔C鶯痔C^	1110010110100011111101101100000001000011111001011010001111110110110000000100001101011110	e5a3f6c043e5a3f6c0435e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)