To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????oBF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6f4246
SJIS-WIN セォ鴫治釈セュ痔セャ社セォ鴫セュ竺セュ実セャ軸oBF 1011111010101011100011101011000010001110101000011000111011011111101111101010110110001110101001001011111010101100100011101101000010111110101010111000111010110000101111101010110110001110101100011011111010101101100011101100000010111110101011001000111010110010011011110100001001000110 beab8eb08ea18edfbead8ea4beac8ed0beab8eb0bead8eb1bead8ec0beac8eb26f4246
EUC-JP セォ鴫治釈セュ痔セャ社セォ鴫セュ竺セュ実セャ軸oBF 10001110101111101000111010101011101111001011001010111100101000111011110011100001100011101011111010001110101011011011110010100110100011101011111010001110101011001011110011010010100011101011111010001110101010111011110010110010100011101011111010001110101011011011110010110011100011101011111010001110101011011011110011000010100011101011111010001110101011001011110010110100011011110100001001000110 8ebe8eabbcb2bca3bce18ebe8eadbca68ebe8eacbcd28ebe8eabbcb28ebe8eadbcb38ebe8eadbcc28ebe8eacbcb46f4246
UTF-8 セォ鴫治釈セュ痔セャ社セォ鴫セュ竺セュ実セャ軸oBF 111011111011110110111110111011111011110110101011111010011011010010101011111001101011001010111011111010011000011110001000111011111011110110111110111011111011110110101101111001111001011110010100111011111011110110111110111011111011110110101100111001111010010010111110111011111011110110111110111011111011110110101011111010011011010010101011111011111011110110111110111011111011110110101101111001111010101110111010111011111011110110111110111011111011110110101101111001011010111010011111111011111011110110111110111011111011110110101100111010001011101110111000011011110100001001000110 efbdbeefbdabe9b4abe6b2bbe98788efbdbeefbdade79794efbdbeefbdace7a4beefbdbeefbdabe9b4abefbdbeefbdade7abbaefbdbeefbdade5ae9fefbdbeefbdace8bbb86f4246
UHC ???治???痔??社?????竺?????軸oBF 00111111001111110011111111110110101111010011111100111111001111111111011011000000001111110011111111011110111001000011111100111111001111110011111100111111111101011110011100111111001111110011111100111111001111111111010111101110011011110100001001000110 3f3f3ff6bd3f3f3ff6c03f3fdee43f3f3f3f3ff5e73f3f3f3f3ff5ee6f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)