Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???G?	0011111100111111001111110100011100111111	3f3f3f473f
SJIS-WIN	苟器什G藪	111001001001000110001010111011011000111101011001010001111110010101001101	e4918aed8f5947e54d
EUC-JP	苟器什G藪	111001111111000110110100111011111011110110111010010001111110100110101110	e7f1b4efbdba47e9ae
UTF-8	苟器什G藪	11101000100010111001111111100101100110011010100011100100101110111000000001000111111010001001011110101010	e88b9fe599a8e4bb8047e897aa
UHC	苟器什G藪	110011111100001011010000111011111110010010100111010001111110001010111111	cfc2d0efe4a747e2bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)