Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	碩茗碩	100100001101011111100100101010101001000011010111	90d7e4aa90d7
EUC-JP	碩茗碩	110000001101100111101000101011001100000011011001	c0d9e8acc0d9
UTF-8	碩茗碩	111001111010001010101001111010001000110010010111111001111010001010101001	e7a2a9e88c97e7a2a9
UHC	碩茗碩	111000001011010111011001101010111110000010110101	e0b5d9abe0b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)