Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	茖種州藪	1110010010100001100011101110110110001111010000101110010101001101	e4a18eed8f42e54d
EUC-JP	茖種州藪	1110100010100011101111001110111110111101101000111110100110101110	e8a3bcefbda3e9ae
UTF-8	茖種州藪	111010001000110010010110111001111010100010101110111001011011011110011110111010001001011110101010	e88c96e7a8aee5b79ee897aa
UHC	?種州藪	00111111111100001111101011110001101101101110001010111111	3ff0faf1b6e2bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)