To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 諺??猷??矣?┯嚴щ?悠??循?㎜? 10001100101111110011111100111111100101110101000100111111001111111110000111100001001111111000010010110110100110101000111010000100100010110011111110010111010010010011111100111111100011110111101000111111100001110110111100111111 8cbf3f3f97513f3fe1e13f84b69a8e848b3f97493f3f8f7a3f876f3f
EUC-JP 諺??猷??矣?┯嚴щ?悠??循??? 101110001100000100111111001111111100110110110010001111110011111111100010111000110011111110101000101110001101001111101110101001111110101100111111110011011010101000111111001111111011110111011011001111110011111100111111 b8c13f3fcdb23f3fe2e33fa8b8d3eea7eb3fcdaa3f3fbddb3f3f3f
UTF-8 諺⑸쉼猷딀룚矣섎┯嚴щ벊悠뽳쫮循뗫㎜力 1110100010101011101110101110001010010001101110001110110010001001101111001110011110001100101101111110101110010100100000001110101110100011100110101110011110011111101000111110110010000100100011101110001010010100101011111110010110011010101101001101000110001001111010111011001010001010111001101000001010100000111010111011110110110011111011001010101110101110111001011011111010101010111010111001011110101011111000111000111010011100111011111010011010001010 e8abbae291b8ec89bce78cb7eb9480eba39ae79fa3ec848ee294afe59ab4d189ebb28ae682a0ebbdb3ecabaee5beaaeb97abe38e9cefa68a
UHC 諺⑸쉼猷딀룚矣섎┯嚴щ벊悠뽳쫮循뗫㎜力 1110010111101100101010011110101110111101101100001110101110100011100010101110011010001111100101101110101111111000100110001110101110100110101110001110010111110001101011001110101110010011101011011110101011101101100101101110111110100110100001101110001011100000100010111110101110100111101011101110011010110011 e5eca9ebbdb0eba38ae68f96ebf898eba6b8e5f1aceb93adeaed96efa686e2e08beba7aee6b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)