To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 蘖????蘖????[蘖????蘖????[^ 100111110101000000111111001111110011111100111111100111110101000000111111001111110011111100111111010110111001111101010000001111110011111100111111001111111001111101010000001111110011111100111111001111110101101101011110 9f503f3f3f3f9f503f3f3f3f5b9f503f3f3f3f9f503f3f3f3f5b5e
EUC-JP 蘖????蘖????[蘖????蘖????[^ 110111011011000100111111001111110011111100111111110111011011000100111111001111110011111100111111010110111101110110110001001111110011111100111111001111111101110110110001001111110011111100111111001111110101101101011110 ddb13f3f3f3fddb13f3f3f3f5bddb13f3f3f3fddb13f3f3f3f5b5e
UTF-8 蘖붾튉烈쨥蘖붾튉烈쨥[蘖붾튉烈쨥蘖붾튉烈쨥[^ 111010001001100010010110111010111011011010111110111011011000101010001001111011111010011010011111111011001010100010100101111010001001100010010110111010111011011010111110111011011000101010001001111011111010011010011111111011001010100010100101010110111110100010011000100101101110101110110110101111101110110110001010100010011110111110100110100111111110110010101000101001011110100010011000100101101110101110110110101111101110110110001010100010011110111110100110100111111110110010101000101001010101101101011110 e89896ebb6beed8a89efa69feca8a5e89896ebb6beed8a89efa69feca8a55be89896ebb6beed8a89efa69feca8a5e89896ebb6beed8a89efa69feca8a55b5e
UHC 蘖붾튉烈쨥蘖붾튉烈쨥[蘖붾튉烈쨥蘖붾튉烈쨥[^ 11100101111011101001010011101011101110011001110111100110111011111010010001111010111001011110111010010100111010111011100110011101111001101110111110100100011110100101101111100101111011101001010011101011101110011001110111100110111011111010010001111010111001011110111010010100111010111011100110011101111001101110111110100100011110100101101101011110 e5ee94ebb99de6efa47ae5ee94ebb99de6efa47a5be5ee94ebb99de6efa47ae5ee94ebb99de6efa47a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)