To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????雅??源??油??闇λ?竊 00111111001111110011111100111111001111110011111100111111001111110011111110001001111010110011111100111111100011001011100100111111001111111001011011111011001111110011111110001000110001011000001111001001001111111110001010000110 3f3f3f3f3f3f3f3f3f89eb3f3f8cb93f3f96fb3f3f88c583c93fe286
EUC-JP ?????????雅??源??油??闇λ?竊 00111111001111110011111100111111001111110011111100111111001111110011111110110010111011010011111100111111101110001011101100111111001111111100110011111101001111110011111110110000110001111010011011001011001111111110001111100110 3f3f3f3f3f3f3f3f3fb2ed3f3fb8bb3f3fccfd3f3fb0c7a6cb3fe3e6
UTF-8 列룸쑜理덃걗栒삼폇雅뚮떞源당뙴油몃쭖闇λ슢竊 1110111110100110100111001110101110100011101110001110110010010001100111001110111110100111101001001110101110001101100000111110101010110001100101111110011010100000100100101110110010000010101111001110110110001111100001111110100110011011100001011110101110011010101011101110101110010110100111101110011010111010100100001110101110001011101110011110101110011001101101001110011010110010101110011110101110101010100000111110110010101101100101101110100110010111100001111100111010111011111011001000101010100010111001111010101110001010 efa69ceba3b8ec919cefa7a4eb8d83eab197e6a092ec82bced8f87e99b85eb9aaeeb969ee6ba90eb8bb9eb99b4e6b2b9ebaa83ecad96e99787cebbec8aa2e7ab8a
UHC 列룸쑜理덃걗栒삼폇雅뚮떞源당뙴油몃쭖闇λ슢竊 1110011011101010101101111110101110011100101110111110110010110101100010001110011010000001100000101110001011100011101110111110111110111100100101001110010010111010100011001110101110001011101101001110101010111001101101001110011110001100101101111110101011111010101110001110101110100111100011101110010011100001101001011110101110011010101011101110111110111100 e6eab7eb9cbbecb588e68182e2e3bbefbc94e4ba8ceb8bb4eab9b4e78cb7eafab8eba78ee4e1a5eb9aaeefbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)