Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	淼ﾄ靈	11111011010001011100010011101000110010111111000111101101	fb45c4e8cbf1ed
EUC-JP	淼ﾄ靈?	1000111111000111111001101000111011000100111100001100110100111111	8fc7e68ec4f0cd3f
UTF-8	淼ﾄ靈	111001101011011110111100111011111011111010000100111010011001110110001000111011101000010110101000	e6b7bcefbe84e99d88ee85a8
UHC	??靈?	0011111100111111110101101100010000111111	3f3fd6c43f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)