Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???BM	0011111100111111001111110100001001001101	3f3f3f424d
SJIS-WIN	晶ｪBM	10001111101110111111010010001110101010100100001001001101	8fbbf48eaa424d
EUC-JP	晶?ｪBM	10111110101111010011111110001110101010100100001001001101	bebd3f8eaa424d
UTF-8	晶ｪBM	1110011010011001101101101110111010001100101111011110111110111101101010100100001001001101	e699b6ee8cbdefbdaa424d
UHC	晶??BM	111011111101110000111111001111110100001001001101	efdc3f3f424d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)