To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セュ社セュ胥自汐セュ軸セャ竺セュ自セュ社セュ 101111101010110110001110110100001011111010101101111000111110111110001110101010011000111010101100101111101010110110001110101100101011111010101100100011101011000110111110101011011000111010101001101111101010110110001110110100001011111010101101 bead8ed0beade3ef8ea98eacbead8eb2beac8eb1bead8ea9bead8ed0bead
EUC-JP セュ社セュ胥自汐セュ軸セャ竺セュ自セュ社セュ 1000111010111110100011101010110110111100110100101000111010111110100011101010110111100110111100011011110010101011101111001010111010001110101111101000111010101101101111001011010010001110101111101000111010101100101111001011001110001110101111101000111010101101101111001010101110001110101111101000111010101101101111001101001010001110101111101000111010101101 8ebe8eadbcd28ebe8eade6f1bcabbcae8ebe8eadbcb48ebe8eacbcb38ebe8eadbcab8ebe8eadbcd28ebe8ead
UTF-8 セュ社セュ胥自汐セュ軸セャ竺セュ自セュ社セュ 111011111011110110111110111011111011110110101101111001111010010010111110111011111011110110111110111011111011110110101101111010001000001110100101111010001000011110101010111001101011000110010000111011111011110110111110111011111011110110101101111010001011101110111000111011111011110110111110111011111011110110101100111001111010101110111010111011111011110110111110111011111011110110101101111010001000011110101010111011111011110110111110111011111011110110101101111001111010010010111110111011111011110110111110111011111011110110101101 efbdbeefbdade7a4beefbdbeefbdade883a5e887aae6b190efbdbeefbdade8bbb8efbdbeefbdace7abbaefbdbeefbdade887aaefbdbeefbdade7a4beefbdbeefbdad
UHC ??社??胥自汐??軸??竺??自??社?? 001111110011111111011110111001000011111100111111111000001010000111101101101110111110000010110001001111110011111111110101111011100011111100111111111101011110011100111111001111111110110110111011001111110011111111011110111001000011111100111111 3f3fdee43f3fe0a1edbbe0b13f3ff5ee3f3ff5e73f3fedbb3f3fdee43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)