To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セュ竺セ、鉦ヒ爾礁ュ竺セ、鉦ヒ爾礁 1011111010101101100011101011000110111110101001001000111111011110110010111000111010100010100011111100101011110001101111101010110110001110101100011011111010100100100011111101111011001011100011101010001010001111110010101111000101000010 bead8eb1bea48fdecb8ea28fcaf1bead8eb1bea48fdecb8ea28fcaf142
EUC-JP セュ竺セ、鉦ヒ爾礁?ュ竺セ、鉦ヒ爾礁? 100011101011111010001110101011011011110010110011100011101011111010001110101001001011111011100000100011101100101110111100101001001011111011001100001111111000111010101101101111001011001110001110101111101000111010100100101111101110000010001110110010111011110010100100101111101100110000111111 8ebe8eadbcb38ebe8ea4bee08ecbbca4becc3f8eadbcb38ebe8ea4bee08ecbbca4becc3f
UTF-8 セュ竺セ、鉦ヒ爾礁ュ竺セ、鉦ヒ爾礁 111011111011110110111110111011111011110110101101111001111010101110111010111011111011110110111110111011111011110110100100111010011000100110100110111011111011111010001011111001111000100010111110111001111010010010000001111011101000010010111001111011111011110110101101111001111010101110111010111011111011110110111110111011111011110110100100111010011000100110100110111011111011111010001011111001111000100010111110111001111010010010000001111011101000001010111110 efbdbeefbdade7abbaefbdbeefbda4e989a6efbe8be788bee7a481ee84b9efbdade7abbaefbdbeefbda4e989a6efbe8be788bee7a481ee82be
UHC ??竺??鉦?爾礁??竺??鉦?爾礁? 001111110011111111110101111001110011111100111111111011111111101000111111111011001011001111110101101001110011111100111111111101011110011100111111001111111110111111111010001111111110110010110011111101011010011100111111 3f3ff5e73f3feffa3fecb3f5a73f3ff5e73f3feffa3fecb3f5a73f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)