To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 僥??撓???ヨ?絶?ぜ節??撓???ヮ?? 100110010100011000111111001111111001110110011010001111110011111100111111100000111000100000111111100100001110001000111111100000101011101010010000110111110011111100111111100111011001101000111111001111110011111110000011100011100011111100111111 99463f3f9d9a3f3f3f83883f90e23f82ba90df3f3f9d9a3f3f3f838e3f3f
EUC-JP 僥??撓???ヨ?絶?ぜ節??撓???ヮ?? 110100011010011100111111001111111101100111111010001111110011111100111111101001011110100000111111110000001110010000111111101001001011110011000000111000010011111100111111110110011111101000111111001111110011111110100101111011100011111100111111 d1a73f3fd9fa3f3f3fa5e83fc0e43fa4bcc0e13f3fd9fa3f3f3fa5ee3f3f
UTF-8 僥울풐撓뷂쉼樂ヨ짅絶귡ぜ節억풐撓뷂쉼樂ヮ뀾連 111001011000001110100101111011001001101010111000111011011001001010010000111001101001001010010011111010111011011110000010111011001000100110111100111011111010011010111111111000111000001110101000111011001010011110000101111001111011010110110110111010101011011110100001111000111000000110011100111001111010111110000000111011001001011010110101111011011001001010010000111001101001001010010011111010111011011110000010111011001000100110111100111011111010011010111111111000111000001110101110111010111000000010111110111011111010011010011010 e583a5ec9ab8ed9290e69293ebb782ec89bcefa6bfe383a8eca785e7b5b6eab7a1e3819ce7af80ec96b5ed9290e69293ebb782ec89bcefa6bfe383aeeb80beefa69a
UHC 僥울풐撓뷂쉼樂ヨ짅絶귡ぜ節억풐撓뷂쉼樂ヮ뀾連 1110100011101001101111111110111110111110100101001110100011110101100101001110111110111101101100001110100011111001101010111110100010100011100101001110111110111110100000101110100110101010101111001110111110111101101111101110111110111110100101001110100011110101100101001110111110111101101100001110100011111001101010111110111010000101101101001110011011100110 e8e9bfefbe94e8f594efbdb0e8f9abe8a394efbe82e9aabcefbdbeefbe94e8f594efbdb0e8f9abee85b4e6e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)