To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????????????????B 00111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蛛イ豐サ}蛛イ閠ウ遶コ謐ィ遶コ謐ィ蛛イ辷セB 1110010110000001101100101110011010110010101110110111110111100101100000011011001011101000100000001011001111100111101010111011101011100110100011011010100011100111101010111011101011100110100011011010100011100101100000011011001011100111100010001011111001000010 e581b2e6b2bb7de581b2e880b3e7abbae68da8e7abbae68da8e581b2e788be42
EUC-JP 蛛イ豐サ}蛛イ閠ウ遶コ謐ィ遶コ謐ィ蛛イ辷セB 111010011110000110001110101100101110110010110100100011101011101101111101111010011110000110001110101100101110111111100000100011101011001111101110101011011000111010111010111010111110110110001110101010001110111010101101100011101011101011101011111011011000111010101000111010011110000110001110101100101110110111101000100011101011111001000010 e9e18eb2ecb48ebb7de9e18eb2efe08eb3eead8ebaebed8ea8eead8ebaebed8ea8e9e18eb2ede88ebe42
UTF-8 蛛イ豐サ}蛛イ閠ウ遶コ謐ィ遶コ謐ィ蛛イ辷セB 1110100010011011100110111110111110111101101100101110100010110001100100001110111110111101101110110111110111101000100110111001101111101111101111011011001011101001100101101010000011101111101111011011001111101001100000011011011011101111101111011011101011101000101011001001000011101111101111011010100011101001100000011011011011101111101111011011101011101000101011001001000011101111101111011010100011101000100110111001101111101111101111011011001011101000101111101011011111101111101111011011111001000010 e89b9befbdb2e8b190efbdbb7de89b9befbdb2e996a0efbdb3e981b6efbdbae8ac90efbda8e981b6efbdbae8ac90efbda8e89b9befbdb2e8beb7efbdbe42
UHC 蛛???}蛛?????謐???謐?蛛???B 111100011100100000111111001111110011111101111101111100011100100000111111001111110011111100111111001111111101101011001101001111110011111100111111110110101100110100111111111100011100100000111111001111110011111101000010 f1c83f3f3f7df1c83f3f3f3f3fdacd3f3f3fdacd3ff1c83f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)