To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????j}v????????j}vB 0011111100111111001111110011111100111111001111110011111100111111011010100111110101110110001111110011111100111111001111110011111100111111001111110011111101101010011111010111011001000010 3f3f3f3f3f3f3f3f6a7d763f3f3f3f3f3f3f3f6a7d7642
SJIS-WIN 蛛イ閠祁蛛イ迺スj}v蛛イ閠祁蛛イ迺スj}vB 111001011000000110110010111010001000000010001100010101101110010110000001101100101110011110010010101111010110101001111101011101101110010110000001101100101110100010000000100011000101011011100101100000011011001011100111100100101011110101101010011111010111011001000010 e581b2e8808c56e581b2e792bd6a7d76e581b2e8808c56e581b2e792bd6a7d7642
EUC-JP 蛛イ閠祁蛛イ迺スj}v蛛イ閠祁蛛イ迺スj}vB 111010011110000110001110101100101110111111100000101101111011011111101001111000011000111010110010111011011111001010001110101111010110101001111101011101101110100111100001100011101011001011101111111000001011011110110111111010011110000110001110101100101110110111110010100011101011110101101010011111010111011001000010 e9e18eb2efe0b7b7e9e18eb2edf28ebd6a7d76e9e18eb2efe0b7b7e9e18eb2edf28ebd6a7d7642
UTF-8 蛛イ閠祁蛛イ迺スj}v蛛イ閠祁蛛イ迺スj}vB 11101000100110111001101111101111101111011011001011101001100101101010000011100111101001011000000111101000100110111001101111101111101111011011001011101000101111111011101011101111101111011011110101101010011111010111011011101000100110111001101111101111101111011011001011101001100101101010000011100111101001011000000111101000100110111001101111101111101111011011001011101000101111111011101011101111101111011011110101101010011111010111011001000010 e89b9befbdb2e996a0e7a581e89b9befbdb2e8bfbaefbdbd6a7d76e89b9befbdb2e996a0e7a581e89b9befbdb2e8bfbaefbdbd6a7d7642
UHC 蛛??祁蛛???j}v蛛??祁蛛???j}vB 1111000111001000001111110011111111010001101101011111000111001000001111110011111100111111011010100111110101110110111100011100100000111111001111111101000110110101111100011100100000111111001111110011111101101010011111010111011001000010 f1c83f3fd1b5f1c83f3f3f6a7d76f1c83f3fd1b5f1c83f3f3f6a7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)