Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????B	00111111001111110011111100111111001111110011111101000010	3f3f3f3f3f3f42
SJIS-WIN	邪糘邪糘B	10001110110101111110001011110010111100011101101110001110110101111110001011110010111100011101101101000010	8ed7e2f2f1db8ed7e2f2f1db42
EUC-JP	邪糘?邪糘?B	1011110011011001111001001111010000111111101111001101100111100100111101000011111101000010	bcd9e4f43fbcd9e4f43f42
UTF-8	邪糘邪糘B	11101001100000101010101011100111101100111001100011101110100001011001011011101001100000101010101011100111101100111001100011101110100001011001011001000010	e982aae7b398ee8596e982aae7b398ee859642
UHC	邪??邪??B	110111101111011100111111001111111101111011110111001111110011111101000010	def73f3fdef73f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)