Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	薔≫侵纓雁	11100101010010111000000111100010100100000100111011100011100101111000101011100101	e54b81e2904ee3978ae5
EUC-JP	薔≫侵纓雁	11101001101011001010001011100100101111111010111111100101111101111011010011100111	e9aca2e4bfafe5f7b4e7
UTF-8	薔≫侵纓雁	111010001001011010010100111000101000100110101011111001001011111010110101111001111011101010010011111010011001101110000001	e89694e289abe4beb5e7ba93e99b81
UHC	薔≫侵纓雁	11101101111110011010000111101101111101101101010111100111110001011110010011010010	edf9a1edf6d5e7c5e4d2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)