To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 鷄憬熔鷄憬熔^ 11101010010100001001110011011011100101110110111111101010010100001001110011011011100101110110111101011110 ea509cdb976fea509cdb976f5e
EUC-JP 鷄憬熔鷄憬熔^ 11110011101100011101100011011101110011011101000011110011101100011101100011011101110011011101000001011110 f3b1d8ddcdd0f3b1d8ddcdd05e
UTF-8 鷄憬熔鷄憬熔^ 11101001101101111000010011100110100001101010110011100111100001101001010011101001101101111000010011100110100001101010110011100111100001101001010001011110 e9b784e686ace78694e9b784e686ace786945e
UHC 鷄憬熔鷄憬熔^ 11001101101011101100110011010101111010011100001011001101101011101100110011010101111010011100001001011110 cdaeccd5e9c2cdaeccd5e9c25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)