Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????h	001111110011111100111111001111110011111101101000	3f3f3f3f3f68
SJIS-WIN	蠅ｺﾑ作旺h	111001011010001010111010110100011000110111101100100010011010000001101000	e5a2bad18dec89a068
EUC-JP	蠅ｺﾑ作旺h	1110101010100100100011101011101010001110110100011011101011101110101100101010001001101000	eaa48eba8ed1baeeb2a268
UTF-8	蠅ｺﾑ作旺h	11101000101000001000010111101111101111011011101011101111101111101001000111100100101111011001110011100110100101111011101001101000	e8a085efbdbaefbe91e4bd9ce697ba68
UHC	蠅??作旺h	111000111011001000111111001111111110110111000010111010001101101001101000	e3b23f3fedc2e8da68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)