To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 源?障?悠???趙陌舞源?縡?愉???趙脈 1000110010111001001111111000111111100001001111111001011101001001001111110011111100111111111001101110001011101000100110011001010110010001100011001011100100111111111000110111000100111111100101101111100100111111001111110011111111100110111000101001011010101100 8cb93f8fe13f97493f3f3fe6e2e89995918cb93fe3713f96f93f3f3fe6e296ac
EUC-JP 源?障?悠???趙陌舞源?縡?愉???趙脈 1011100010111011001111111011111011100011001111111100110110101010001111110011111100111111111011001110010011101111111110011100100111110001101110001011101100111111111001011101001000111111110011001111101100111111001111110011111111101100111001001100110010101110 b8bb3fbee33fcdaa3f3f3fece4eff9c9f1b8bb3fe5d23fccfb3f3f3fece4ccae
UTF-8 源렰障렚悠꿸렕렟趙陌舞源렰縡렞愉브렕렟趙脈 111001101011101010010000111010111010000010110000111010011001101010011100111010111010000010011010111001101000001010100000111010101011111110111000111010111010000010010101111010111010000010011111111010001011011010011001111010011001100110001100111010001000100010011110111001101011101010010000111010111010000010110000111001111011100010100001111010111010000010011110111001101000010010001001111010111011100010001100111010111010000010010101111010111010000010011111111010001011011010011001111010001000010010001000 e6ba90eba0b0e99a9ceba09ae682a0eabfb8eba095eba09fe8b699e9998ce8889ee6ba90eba0b0e7b8a1eba09ee68489ebb88ceba095eba09fe8b699e88488
UHC 源렰障렚悠꿸렕렟趙陌舞源렰縡렞愉브렕렟趙脈 111010101011100110001110101111011110111010100001100011101010110111101010111011011011001011101010100011101010101010001110101100001111000011100001110110001110100011011001111100011110101010111001100011101011110111101110101011011000111010101111111010101111000010111010111010101000111010101010100011101011000011110000111000011101100011100110 eab98ebdeea18eadeaedb2ea8eaa8eb0f0e1d8e8d9f1eab98ebdeead8eafeaf0baea8eaa8eb0f0e1d8e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)