To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 薔????????莎?膺?茨?夭??垈^ 1110010101001011001111110011111100111111001111110011111100111111001111110011111111100100101100110011111111100100010111100011111110001000111011110011111110011010111011100011111100111111100110101011000001011110 e54b3f3f3f3f3f3f3f3fe4b33fe45e3f88ef3f9aee3f3f9ab05e
EUC-JP 薔????玎???莎馝膺?茨?夭??垈^ 111010011010110000111111001111110011111100111111100011111100101111010010001111110011111100111111111010001011010110001111111010001111100011100111101111110011111110110000111100010011111111010100111100000011111100111111110101001011001001011110 e9ac3f3f3f3f8fcbd23f3f3fe8b58fe8f8e7bf3fb0f13fd4f03f3fd4b25e
UTF-8 薔몄렮댄셉玎렎뤉롭莎馝膺뺨茨즉夭쨵뭍垈^ 11101000100101101001010011101011101010101000010011101011101000001010111011101011100011001000010011101100100001011000100111100111100011101000111011101011101000001000111011101011101001001000100111101011101000011010110111101000100011101000111011101001101001101001110111101000100001101011101011101011101110101010100011101000100011001010100011101100101001101000100111100101101001001010110111101100101010001011010111101011101011011000110111100101100111101000100001011110 e89694ebaa84eba0aeeb8c84ec8589e78e8eeba08eeba489eba1ade88e8ee9a69de886baebbaa8e88ca8eca689e5a4adeca8b5ebad8de59e885e
UHC 薔몄렮댄셉玎렎뤉롭莎馝膺뺨茨즉夭쨵뭍垈^ 111011011111100110111000111011001000111010111011101101001110110110111100110000011110111111101001100011101010010010001111101110011011011111010011110111101110110111111001101110001110101111101100101110111011010011101101101111001100000111101111111010001110110010100100100011111011100110110111110100111101110001011110 edf9b8ec8ebbb4edbcc1efe98ea48fb9b7d3deedf9b8ebecbbb4edbcc1efe8eca48fb9b7d3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)