Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	窈??瑤??	1110001001110111001111110011111111101010101000100011111100111111	e2773f3feaa23f3f
EUC-JP	窈??瑤??	1110001111011000001111110011111111110100101001000011111100111111	e3d83f3ff4a43f3f
UTF-8	窈뤹걀瑤욤뫍	111001111010101010001000111010111010010010111001111010101011000110000000111001111001000110100100111011001001101010100100111010111010101110001101	e7aa88eba4b9eab180e791a4ec9aa4ebab8d
UHC	窈뤹걀瑤욤뫍	111010011010000110001111111001111011000010111111111010001111110110111111111010001001000110101111	e9a18fe7b0bfe8fdbfe891af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)