To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????揄??荏???????揄??荏?B 001111110011111100111111001111110011111100111111100111011000100100111111001111111000100101100000001111110011111100111111001111110011111100111111001111111001110110001001001111110011111110001001011000000011111101000010 3f3f3f3f3f3f9d893f3f89603f3f3f3f3f3f3f9d893f3f89603f42
EUC-JP ???彛??揄??荏????彛??揄??荏?B 00111111001111110011111110001111101111001111101000111111001111111101100111101001001111110011111110110001110000010011111100111111001111110011111110001111101111001111101000111111001111111101100111101001001111110011111110110001110000010011111101000010 3f3f3f8fbcfa3f3fd9e93f3fb1c13f3f3f3f8fbcfa3f3fd9e93f3fb1c13f42
UTF-8 列룸씈彛쏁뙴揄쒕쓡荏춚列룸씈彛쏁뙴揄쒕쓡荏춚B 11101111101001101001110011101011101000111011100011101100100101001000100011100101101111011001101111101100100011111000000111101011100110011011010011100110100011111000010011101100100100101001010111101100100100111010000111101000100011011000111111101100101101101001101011101111101001101001110011101011101000111011100011101100100101001000100011100101101111011001101111101100100011111000000111101011100110011011010011100110100011111000010011101100100100101001010111101100100100111010000111101000100011011000111111101100101101101001101001000010 efa69ceba3b8ec9488e5bd9bec8f81eb99b4e68f84ec9295ec93a1e88d8fecb69aefa69ceba3b8ec9488e5bd9bec8f81eb99b4e68f84ec9295ec93a1e88d8fecb69a42
UHC 列룸씈彛쏁뙴揄쒕쓡荏춚列룸씈彛쏁뙴揄쒕쓡荏춚B 111001101110101010110111111010111001110110100000111011001010110110011011111001111000110010110111111010101111000110011100111010111001110110000010111011001111101110101101011101101110011011101010101101111110101110011101101000001110110010101101100110111110011110001100101101111110101011110001100111001110101110011101100000101110110011111011101011010111011001000010 e6eab7eb9da0ecad9be78cb7eaf19ceb9d82ecfbad76e6eab7eb9da0ecad9be78cb7eaf19ceb9d82ecfbad7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)