Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???S???W	0011111100111111001111110101001100111111001111110011111101010111	3f3f3f533f3f3f57
SJIS-WIN	???S??咐W	001111110011111100111111010100110011111100111111100110011111001101010111	3f3f3f533f3f99f357
EUC-JP	???S??咐W	001111110011111100111111010100110011111100111111110100101111010101010111	3f3f3f533f3fd2f557
UTF-8	룶엌∼S룶웩咐W	1110101110100011101101101110110010010111100011001110001010001000101111000101001111101011101000111011011011101100100110111010100111100101100100101001000001010111	eba3b6ec978ce288bc53eba3b6ec9ba9e5929057
UHC	룶엌∼S룶웩咐W	1000111110101011101111101111110110100001101011010101001110001111101010111100000010100001110111001111101101010111	8fabbefda1ad538fabc0a1dcfb57

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)