Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??i??iB	00111111001111110110100100111111001111110110100101000010	3f3f693f3f6942
SJIS-WIN	捨蛇i捨蛇iB	1000111011001100100011101101011001101001100011101100110010001110110101100110100101000010	8ecc8ed6698ecc8ed66942
EUC-JP	捨蛇i捨蛇iB	1011110011001110101111001101100001101001101111001100111010111100110110000110100101000010	bccebcd869bccebcd86942
UTF-8	捨蛇i捨蛇iB	111001101000110110101000111010001001101110000111011010011110011010001101101010001110100010011011100001110110100101000010	e68da8e89b8769e68da8e89b876942
UHC	捨蛇i捨蛇iB	1101111011010111110111101110111101101001110111101101011111011110111011110110100101000010	ded7deef69ded7deef6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)