To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?淨?鬱屯??趙貊?賊?障?除???趙貊 1110001101110001001111111001111111000100001111111001111101010100100100111101010000111111001111111110011011100010111001101011101100111111100100011010111100111111100011111110000100111111100011111001110000111111001111110011111111100110111000101110011010111011 e3713f9fc43f9f5493d43f3fe6e2e6bb3f91af3f8fe13f8f9c3f3f3fe6e2e6bb
EUC-JP 縡?淨?鬱屯??趙貊?賊?障?除???趙貊 1110010111010010001111111101111011000110001111111101110110110101110001101101011000111111001111111110110011100100111011001011110100111111110000101011000100111111101111101110001100111111101111011111110000111111001111110011111111101100111001001110110010111101 e5d23fdec63fddb5c6d63f3fece4ecbd3fc2b13fbee33fbdfc3f3f3fece4ecbd
UTF-8 縡렕淨렠鬱屯렕렟趙貊뱌賊렠障렚除곌렕렟趙貊 111001111011100010100001111010111010000010010101111001101011011110101000111010111010000010100000111010011010110010110001111001011011000110101111111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010111010111011000110001100111010001011001110001010111010111010000010100000111010011001101010011100111010111010000010011010111010011001100110100100111010101011001110001100111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010 e7b8a1eba095e6b7a8eba0a0e9acb1e5b1afeba095eba09fe8b699e8b28aebb18ce8b38aeba0a0e99a9ceba09ae999a4eab38ceba095eba09fe8b699e8b28a
UHC 縡렕淨렠鬱屯렕렟趙貊뱌賊렠障렚除곌렕렟趙貊 111011101010110110001110101010101110111111100100100011101011000111101010101001101101010011101010100011101010101010001110101100001111000011100001110110001110011110111001111100101110111011100100100011101011000111101110101000011000111010101101111100001011011010110000111010101000111010101010100011101011000011110000111000011101100011100111 eead8eaaefe48eb1eaa6d4ea8eaa8eb0f0e1d8e7b9f2eee48eb1eea18eadf0b6b0ea8eaa8eb0f0e1d8e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)