Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	贖鰹修茱	1110011011011100100010101000111110001111010000111110010010100011	e6dc8a8f8f43e4a3
EUC-JP	贖鰹修茱	1110110011011110101100111110111110111101101001001110100010100101	ecdeb3efbda4e8a5
UTF-8	贖鰹修茱	111010001011010010010110111010011011000010111001111001001011111110101110111010001000110010110001	e8b496e9b0b9e4bfaee88cb1
UHC	贖?修茱	11100001110110110011111111100001111100111110001010111100	e1db3fe1f3e2bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)