Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	wßúA	1000111101110111110111111111101001000001	8f77dffa41
SJIS-WIN	?w??A	0011111101110111001111110011111101000001	3f773f3f41
EUC-JP	?wßúA	001111110111011110001111101010011100111010001111101010111110001001000001	3f778fa9ce8fabe241
UTF-8	wßúA	1100001010001111011101111100001110011111110000111011101001000001	c28f77c39fc3ba41
UHC	?wß?A	001111110111011110101001101011000011111101000001	3f77a9ac3f41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)