Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???^	00111111001111110011111101011110	3f3f3f5e
SJIS-WIN	豌旧岡^	11100110101100001000101110001100100010011010101001011110	e6b08b8c89aa5e
EUC-JP	豌旧岡^	11101100101100101011010111101100101100101010110001011110	ecb2b5ecb2ac5e
UTF-8	豌旧岡^	11101000101100011000110011100110100101111010011111100101101100101010000101011110	e8b18ce697a7e5b2a15e
UHC	豌?岡^	111010001101010100111111110010111010101001011110	e8d53fcbaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)