Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	聰ｋ．蟯	1110001111100000100000101000101110000001010001001110010110110010	e3e0828b8144e5b2
EUC-JP	聰ｋ．蟯	1110011011100010101000111110101110100001101001011110101010110100	e6e2a3eba1a5eab4
UTF-8	聰ｋ．蟯	111010001000000110110000111011111011110110001011111011111011110010001110111010001001111110101111	e881b0efbd8befbc8ee89faf
UHC	聰ｋ．蟯	1111010111000110101000111110101110100011101011101110100110101000	f5c6a3eba3aee9a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)