To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蝴???驀????硫蝴???驀????硫^ 111001011001101000111111001111110011111111101001011111010011111100111111001111110011111110010111101100001110010110011010001111110011111100111111111010010111110100111111001111110011111100111111100101111011000001011110 e59a3f3f3fe97d3f3f3f3f97b0e59a3f3f3fe97d3f3f3f3f97b05e
EUC-JP 蝴???驀????硫蝴???驀????硫^ 111010011111101000111111001111110011111111110001110111100011111100111111001111110011111111001110101100101110100111111010001111110011111100111111111100011101111000111111001111110011111100111111110011101011001001011110 e9fa3f3f3ff1de3f3f3f3fceb2e9fa3f3f3ff1de3f3f3f3fceb25e
UTF-8 蝴렫롋꾀驀렲렫롋꾀硫蝴렫롋꾀驀렲렫롋꾀硫^ 11101000100111011011010011101011101000001010101111101011101000011000101111101010101111101000000011101001101010011000000011101011101000001011001011101011101000001010101111101011101000011000101111101010101111101000000011100111101000011010101111101000100111011011010011101011101000001010101111101011101000011000101111101010101111101000000011101001101010011000000011101011101000001011001011101011101000001010101111101011101000011000101111101010101111101000000011100111101000011010101101011110 e89db4eba0abeba18beabe80e9a980eba0b2eba0abeba18beabe80e7a1abe89db4eba0abeba18beabe80e9a980eba0b2eba0abeba18beabe80e7a1ab5e
UHC 蝴렫롋꾀驀렲렫롋꾀硫蝴렫롋꾀驀렲렫롋꾀硫^ 1111101111011101100011101011100110001110110100011011001011010010110110001110100110001110101111111000111010111001100011101101000110110010110100101101011110111100111110111101110110001110101110011000111011010001101100101101001011011000111010011000111010111111100011101011100110001110110100011011001011010010110101111011110001011110 fbdd8eb98ed1b2d2d8e98ebf8eb98ed1b2d2d7bcfbdd8eb98ed1b2d2d8e98ebf8eb98ed1b2d2d7bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)