To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 繒祠???鞨??憶?繒祠???鞨??憶?^ 1111101110001111111000100100101100111111001111110011111111101000111000000011111100111111100010011010111100111111111110111000111111100010010010110011111100111111001111111110100011100000001111110011111110001001101011110011111101011110 fb8fe24b3f3f3fe8e03f3f89af3ffb8fe24b3f3f3fe8e03f3f89af3f5e
EUC-JP 繒祠???鞨??憶?繒祠???鞨??憶?^ 10001111110101001101010011100011101011000011111100111111001111111111000011100010001111110011111110110010101100010011111110001111110101001101010011100011101011000011111100111111001111111111000011100010001111110011111110110010101100010011111101011110 8fd4d4e3ac3f3f3ff0e23f3fb2b13f8fd4d4e3ac3f3f3ff0e23f3fb2b13f5e
UTF-8 繒祠렪렮罹鞨렲렭憶쌨繒祠렪렮罹鞨렲렭憶쌤^ 11100111101110011001001011100111101001011010000011101011101000001010101011101011101000001010111011101111101001111010011011101001100111101010100011101011101000001011001011101011101000001010110111100110100001101011011011101100100011001010100011100111101110011001001011100111101001011010000011101011101000001010101011101011101000001010111011101111101001111010011011101001100111101010100011101011101000001011001011101011101000001010110111100110100001101011011011101100100011001010010001011110 e7b992e7a5a0eba0aaeba0aeefa7a6e99ea8eba0b2eba0ade686b6ec8ca8e7b992e7a5a0eba0aaeba0aeefa7a6e99ea8eba0b2eba0ade686b6ec8ca45e
UHC 繒祠렪렮罹鞨렲렭憶쌨繒祠렪렮罹鞨렲렭憶쌤^ 1111000111111001110111101110011010001110101110001000111010111011111011001011101011001010111010101000111010111111100011101011101011100101111000111011110111011110111100011111100111011110111001101000111010111000100011101011101111101100101110101100101011101010100011101011111110001110101110101110010111100011101111011101110001011110 f1f9dee68eb88ebbecbacaea8ebf8ebae5e3bddef1f9dee68eb88ebbecbacaea8ebf8ebae5e3bddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)