To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 茹??遊??幽??壓??瑜??柔レ?堊??? 111001001010010100111111001111111001011101010110001111110011111110010111010010000011111100111111100110101101100000111111001111111110000011101111001111110011111110001111010111111000001110001100001111111001101010111111001111110011111100111111 e4a53f3f97563f3f97483f3f9ad83f3fe0ef3f3f8f5f838c3f9abf3f3f3f
EUC-JP 茹??遊??幽??壓??瑜??柔レ?堊??? 111010001010011100111111001111111100110110110111001111110011111111001101101010010011111100111111110101001101101000111111001111111110000011110001001111110011111110111101110000001010010111101100001111111101010011000001001111110011111100111111 e8a73f3fcdb73f3fcda93f3fd4da3f3fe0f13f3fbdc0a5ec3fd4c13f3f3f
UTF-8 茹띿슜遊삡펺幽뚯춷壓믩갭瑜당춯柔レ돹堊묆렕流 111010001000110010111001111010111001110110111111111011001000101010011100111010011000000110001010111011001000001010100001111011011000111010111010111001011011100110111101111010111001101010101111111011001011011010110111111001011010001110010011111010111010111110101001111010101011000010101101111001111001000110011100111010111000101110111001111011001011011010101111111001101001111110010100111000111000001110101100111010111000111110111001111001011010000010001010111010111010110010000110111010111010000010010101111011111010011110001010 e88cb9eb9dbfec8a9ce9818aec82a1ed8ebae5b9bdeb9aafecb6b7e5a393ebafa9eab0ade7919ceb8bb9ecb6afe69f94e383aceb8fb9e5a08aebac86eba095efa78a
UHC 茹띿슜遊삡펺幽뚯춷壓믩갭瑜당춯柔レ돹堊묆렕流 1110011010101010100011011110110010011010101010011110101110110100101110111110010010111100100010101110101011101011100011001110110010101101100100111110010011100010100100101110101110110000101110001110101110100101101101001110011110101101100011001110101011110101101010111110110010001001101111001110010010111110100100011110001110001110101010101110101011111100 e6aa8dec9aa9ebb4bbe4bc8aeaeb8cecad93e4e292ebb0b8eba5b4e7ad8ceaf5abec89bce4be91e38eaaeafc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)