Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????B	0011111100111111001111110011111100111111001111110011111101000010	3f3f3f3f3f3f3f42
SJIS-WIN	邪糅頸贓糘遮B	100011101101011111100010111100001110100011110010111001101101100111100010111100101111001011100010100011101101010101000010	8ed7e2f0e8f2e6d9e2f2f2e28ed542
EUC-JP	邪糅頸贓糘?遮B	1011110011011001111001001111001011110000111101001110110011011011111001001111010000111111101111001101011101000010	bcd9e4f2f0f4ecdbe4f43fbcd742
UTF-8	邪糅頸贓糘遮B	11101001100000101010101011100111101100111000010111101001101000001011100011101000101101001001001111100111101100111001100011101110100010001001100111101001100000011010111001000010	e982aae7b385e9a0b8e8b493e7b398ee8899e981ae42
UHC	邪?頸贓??遮B	110111101111011100111111110011001111001011101101111111000011111100111111111100111011010001000010	def73fccf2edfc3f3ff3b442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)