To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 霑ェ逕壽掠霑ェ逕壽掠[霑ェ逕壽掠霑ェ逕壽掠[^ 111010001011111110101010111001111001010010011010111001101001011110101001111010001011111110101010111001111001010010011010111001101001011110101001010110111110100010111111101010101110011110010100100110101110011010010111101010011110100010111111101010101110011110010100100110101110011010010111101010010101101101011110 e8bfaae7949ae697a9e8bfaae7949ae697a95be8bfaae7949ae697a9e8bfaae7949ae697a95b5e
EUC-JP 霑ェ逕壽掠霑ェ逕壽掠[霑ェ逕壽掠霑ェ逕壽掠[^ 11110000110000011000111010101010111011011111010011010100111010001100111010101011111100001100000110001110101010101110110111110100110101001110100011001110101010110101101111110000110000011000111010101010111011011111010011010100111010001100111010101011111100001100000110001110101010101110110111110100110101001110100011001110101010110101101101011110 f0c18eaaedf4d4e8ceabf0c18eaaedf4d4e8ceab5bf0c18eaaedf4d4e8ceabf0c18eaaedf4d4e8ceab5b5e
UTF-8 霑ェ逕壽掠霑ェ逕壽掠[霑ェ逕壽掠霑ェ逕壽掠[^ 111010011001110010010001111011111011110110101010111010011000000010010101111001011010001110111101111001101000111010100000111010011001110010010001111011111011110110101010111010011000000010010101111001011010001110111101111001101000111010100000010110111110100110011100100100011110111110111101101010101110100110000000100101011110010110100011101111011110011010001110101000001110100110011100100100011110111110111101101010101110100110000000100101011110010110100011101111011110011010001110101000000101101101011110 e99c91efbdaae98095e5a3bde68ea0e99c91efbdaae98095e5a3bde68ea05be99c91efbdaae98095e5a3bde68ea0e99c91efbdaae98095e5a3bde68ea05b5e
UHC 霑?逕壽掠霑?逕壽掠[霑?逕壽掠霑?逕壽掠[^ 111011111100010100111111110011001110111111100001111110001101010111010011111011111100010100111111110011001110111111100001111110001101010111010011010110111110111111000101001111111100110011101111111000011111100011010101110100111110111111000101001111111100110011101111111000011111100011010101110100110101101101011110 efc53fccefe1f8d5d3efc53fccefe1f8d5d35befc53fccefe1f8d5d3efc53fccefe1f8d5d35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)