Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	邪糘症ﾋｼ	1000111011010111111000101111001010001111110001111100101110111100	8ed7e2f28fc7cbbc
EUC-JP	邪糘症ﾋｼ	10111100110110011110010011110100101111101100100110001110110010111000111010111100	bcd9e4f4bec98ecb8ebc
UTF-8	邪糘症ﾋｼ	111010011000001010101010111001111011001110011000111001111001011110000111111011111011111010001011111011111011110110111100	e982aae7b398e79787efbe8befbdbc
UHC	邪?症??	11011110111101110011111111110001111110000011111100111111	def73ff1f83f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)