Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????^	0011111100111111001111110011111101011110	3f3f3f3f5e
SJIS-WIN	?宋??^	001111111001000101110110001111110011111101011110	3f91763f3f5e
EUC-JP	侁宋侁蓀^	100011111011000011111100110000011101011110001111101100001111110010001111110110001111100001011110	8fb0fcc1d78fb0fc8fd8f85e
UTF-8	侁宋侁蓀^	11100100101111101000000111100101101011101000101111100100101111101000000111101000100100111000000001011110	e4be81e5ae8be4be81e893805e
UHC	侁宋侁蓀^	111000111110000011100001111001001110001111100000111000011110000001011110	e3e0e1e4e3e0e1e05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)