Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?W?WF	0011111101010111001111110101011101000110	3f573f5746
SJIS-WIN	巽W巽WF	10010010010001100101011110010010010001100101011101000110	92465792465746
EUC-JP	巽W巽WF	11000011101001110101011111000011101001110101011101000110	c3a757c3a75746
UTF-8	巽W巽WF	111001011011011110111101010101111110010110110111101111010101011101000110	e5b7bd57e5b7bd5746
UHC	巽W巽WF	11100001110111100101011111100001110111100101011101000110	e1de57e1de5746

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)