To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????M????n}????M????n{^ 0011111100111111001111110011111101001101001111110011111100111111001111110110111001111101001111110011111100111111001111110100110100111111001111110011111100111111011011100111101101011110 3f3f3f3f4d3f3f3f3f6e7d3f3f3f3f4d3f3f3f3f6e7b5e
SJIS-WIN 魘」鮖ソM訒夊セ朷n}魘」鮖ソM訒夊セ朷n{^ 111010011011010010100011111010011011100110111111010011011111101110100011100110101110100010111110100111100101011001101110011111011110100110110100101000111110100110111001101111110100110111111011101000111001101011101000101111101001111001010110011011100111101101011110 e9b4a3e9b9bf4dfba39ae8be9e566e7de9b4a3e9b9bf4dfba39ae8be9e566e7b5e
EUC-JP 魘」鮖ソM訒夊セ朷n}魘」鮖ソM訒夊セ朷n{^ 1111001010110110100011101010001111110010101110111000111010111111010011011000111111011101110010001101010011101010100011101011111011011011101101110110111001111101111100101011011010001110101000111111001010111011100011101011111101001101100011111101110111001000110101001110101010001110101111101101101110110111011011100111101101011110 f2b68ea3f2bb8ebf4d8fddc8d4ea8ebedbb76e7df2b68ea3f2bb8ebf4d8fddc8d4ea8ebedbb76e7b5e
UTF-8 魘」鮖ソM訒夊セ朷n}魘」鮖ソM訒夊セ朷n{^ 11101001101011011001100011101111101111011010001111101001101011101001011011101111101111011011111101001101111010001010100010010010111001011010010010001010111011111011110110111110111001101001110010110111011011100111110111101001101011011001100011101111101111011010001111101001101011101001011011101111101111011011111101001101111010001010100010010010111001011010010010001010111011111011110110111110111001101001110010110111011011100111101101011110 e9ad98efbda3e9ae96efbdbf4de8a892e5a48aefbdbee69cb76e7de9ad98efbda3e9ae96efbdbf4de8a892e5a48aefbdbee69cb76e7b5e
UHC ????M????n}????M????n{^ 0011111100111111001111110011111101001101001111110011111100111111001111110110111001111101001111110011111100111111001111110100110100111111001111110011111100111111011011100111101101011110 3f3f3f3f4d3f3f3f3f6e7d3f3f3f3f4d3f3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)