Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	鏞?	111110111110001100111111	fbe33f
EUC-JP	鏞湄	100011111110010111001001100011111100011111110001	8fe5c98fc7f1
UTF-8	鏞湄	111010011000111110011110111001101011100110000100	e98f9ee6b984
UHC	鏞湄	11101001110010111101101010110101	e9cbdab5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)