Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	閠檎ｴ勇	11101000100000001000110011100111101101001001011101000101	e8808ce7b49745
EUC-JP	閠檎ｴ勇	1110111111100000101110001110100110001110101101001100110110100110	efe0b8e98eb4cda6
UTF-8	閠檎ｴ勇	111010011001011010100000111001101010101010001110111011111011110110110100111001011000101110000111	e996a0e6aa8eefbdb4e58b87
UHC	?檎?勇	001111111101000011010101001111111110100110111000	3fd0d53fe9b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)