Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????\	001111110011111100111111001111110011111101011100	3f3f3f3f3f5c
SJIS-WIN	??莎パ’\	001111110011111111100100101100111000001101110000100000010110011001011100	3f3fe4b3837081665c
EUC-JP	??莎パ’\	001111110011111111101000101101011010010111010001101000011100011101011100	3f3fe8b5a5d1a1c75c
UTF-8	룴가莎パ’\	11101011101000111011010011101010101100001000000011101000100011101000111011100011100000111001000111100010100000001001100101011100	eba3b4eab080e88e8ee38391e280995c
UHC	룴가莎パ’\	1000111110101001101100001010000111011110111011011010101111010001101000011010111101011100	8fa9b0a1deedabd1a1af5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)