To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 霑ェ逕壼霑ェ踉壽掠[霑ェ逕壼霑ェ踉壽掠[^ 111010001011111110101010111001111001010010011010111001011111010110101001111010001011111110101010111001101111001010011010111001101001011110101001010110111110100010111111101010101110011110010100100110101110010111110101101010011110100010111111101010101110011011110010100110101110011010010111101010010101101101011110 e8bfaae7949ae5f5a9e8bfaae6f29ae697a95be8bfaae7949ae5f5a9e8bfaae6f29ae697a95b5e
EUC-JP 霑ェ逕壼?霑ェ踉壽掠[霑ェ逕壼?霑ェ踉壽掠[^ 1111000011000001100011101010101011101101111101001101010011100111001111111111000011000001100011101010101011101100111101001101010011101000110011101010101101011011111100001100000110001110101010101110110111110100110101001110011100111111111100001100000110001110101010101110110011110100110101001110100011001110101010110101101101011110 f0c18eaaedf4d4e73ff0c18eaaecf4d4e8ceab5bf0c18eaaedf4d4e73ff0c18eaaecf4d4e8ceab5b5e
UTF-8 霑ェ逕壼霑ェ踉壽掠[霑ェ逕壼霑ェ踉壽掠[^ 111010011001110010010001111011111011110110101010111010011000000010010101111001011010001110111100111011101001000010010100111010011001110010010001111011111011110110101010111010001011100010001001111001011010001110111101111001101000111010100000010110111110100110011100100100011110111110111101101010101110100110000000100101011110010110100011101111001110111010010000100101001110100110011100100100011110111110111101101010101110100010111000100010011110010110100011101111011110011010001110101000000101101101011110 e99c91efbdaae98095e5a3bcee9094e99c91efbdaae8b889e5a3bde68ea05be99c91efbdaae98095e5a3bcee9094e99c91efbdaae8b889e5a3bde68ea05b5e
UHC 霑?逕??霑??壽掠[霑?逕??霑??壽掠[^ 111011111100010100111111110011001110111100111111001111111110111111000101001111110011111111100001111110001101010111010011010110111110111111000101001111111100110011101111001111110011111111101111110001010011111100111111111000011111100011010101110100110101101101011110 efc53fccef3f3fefc53f3fe1f8d5d35befc53fccef3f3fefc53f3fe1f8d5d35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)