To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 闕蝉サカ隶占告莉カ蟾櫁告莉カ隶占告莉カ蟾曖 11101000100011011001000011100100101110111011011011101000101011101001000011101000100011011001000011100100101110111011011011100101101101111001111011101000100011011001000011100100101110111011011011101000101011101001000011101000100011011001000011100100101110111011011011100101101101111001111001000010 e88d90e4bbb6e8ae90e88d90e4bbb6e5b79ee88d90e4bbb6e8ae90e88d90e4bbb6e5b79e42
EUC-JP 闕蝉サカ隶占告莉カ蟾櫁告莉カ隶占告莉カ蟾曖 111011111110110111000000111001101000111010111011100011101011011011110000101100001100000011101010101110011111000011101000101111011000111010110110111010101011100111011100111010101011100111110000111010001011110110001110101101101111000010110000110000001110101010111001111100001110100010111101100011101011011011101010101110011101101110100011 efedc0e68ebb8eb6f0b0c0eab9f0e8bd8eb6eab9dceab9f0e8bd8eb6f0b0c0eab9f0e8bd8eb6eab9dba3
UTF-8 闕蝉サカ隶占告莉カ蟾櫁告莉カ隶占告莉カ蟾曖 111010011001011110010101111010001001110110001001111011111011110110111011111011111011110110110110111010011001101010110110111001011000110110100000111001011001000110001010111010001000111010001001111011111011110110110110111010001001111110111110111001101010101110000001111001011001000110001010111010001000111010001001111011111011110110110110111010011001101010110110111001011000110110100000111001011001000110001010111010001000111010001001111011111011110110110110111010001001111110111110111001101001101110010110 e99795e89d89efbdbbefbdb6e99ab6e58da0e5918ae88e89efbdb6e89fbee6ab81e5918ae88e89efbdb6e99ab6e58da0e5918ae88e89efbdb6e89fbee69b96
UHC 闕????占告莉?蟾?告莉??占告莉?蟾曖 110011111111010000111111001111110011111100111111111011111011111111001101101100011101011111101001001111111110000011101010001111111100110110110001110101111110100100111111001111111110111110111111110011011011000111010111111010010011111111100000111010101110010011110010 cff43f3f3f3fefbfcdb1d7e93fe0ea3fcdb1d7e93f3fefbfcdb1d7e93fe0eae4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)