To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セュ爾治識セュ治セュ謝セュ爾治識セュ治セュ謝^ 101111101010110110001110101000101000111010100001100011101010111110111110101011011000111010100001101111101010110110001110110100111011111010101101100011101010001010001110101000011000111010101111101111101010110110001110101000011011111010101101100011101101001101011110 bead8ea28ea18eafbead8ea1bead8ed3bead8ea28ea18eafbead8ea1bead8ed35e
EUC-JP セュ爾治識セュ治セュ謝セュ爾治識セュ治セュ謝^ 100011101011111010001110101011011011110010100100101111001010001110111100101100011000111010111110100011101010110110111100101000111000111010111110100011101010110110111100110101011000111010111110100011101010110110111100101001001011110010100011101111001011000110001110101111101000111010101101101111001010001110001110101111101000111010101101101111001101010101011110 8ebe8eadbca4bca3bcb18ebe8eadbca38ebe8eadbcd58ebe8eadbca4bca3bcb18ebe8eadbca38ebe8eadbcd55e
UTF-8 セュ爾治識セュ治セュ謝セュ爾治識セュ治セュ謝^ 11101111101111011011111011101111101111011010110111100111100010001011111011100110101100101011101111101000101011011001100011101111101111011011111011101111101111011010110111100110101100101011101111101111101111011011111011101111101111011010110111101000101011001001110111101111101111011011111011101111101111011010110111100111100010001011111011100110101100101011101111101000101011011001100011101111101111011011111011101111101111011010110111100110101100101011101111101111101111011011111011101111101111011010110111101000101011001001110101011110 efbdbeefbdade788bee6b2bbe8ad98efbdbeefbdade6b2bbefbdbeefbdade8ac9defbdbeefbdade788bee6b2bbe8ad98efbdbeefbdade6b2bbefbdbeefbdade8ac9d5e
UHC ??爾治識??治??謝??爾治識??治??謝^ 001111110011111111101100101100111111011010111101111000111101101100111111001111111111011010111101001111110011111111011110111100110011111100111111111011001011001111110110101111011110001111011011001111110011111111110110101111010011111100111111110111101111001101011110 3f3fecb3f6bde3db3f3ff6bd3f3fdef33f3fecb3f6bde3db3f3ff6bd3f3fdef35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)