To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN セャ邪樟冐ソセォ鴫セ、偲セュ軸セォ赦フB 101111101010110010001110110101111000111110111110111000111110110010111111101111101010101110001110101100001011111010100100100011101100001110111110101011011000111010110010101111101010101110001110110011011100110001000010 beac8ed78fbee3ecbfbeab8eb0bea48ec3bead8eb2beab8ecdcc42
EUC-JP セャ邪樟冐ソセォ鴫セ、偲セュ軸セォ赦フB 100011101011111010001110101011001011110011011001101111101100000011100110111011101000111010111111100011101011111010001110101010111011110010110010100011101011111010001110101001001011110011000101100011101011111010001110101011011011110010110100100011101011111010001110101010111011110011001111100011101100110001000010 8ebe8eacbcd9bec0e6ee8ebf8ebe8eabbcb28ebe8ea4bcc58ebe8eadbcb48ebe8eabbccf8ecc42
UTF-8 セャ邪樟冐ソセォ鴫セ、偲セュ軸セォ赦フB 11101111101111011011111011101111101111011010110011101001100000101010101011100110101010001001111111100101100001101001000011101111101111011011111111101111101111011011111011101111101111011010101111101001101101001010101111101111101111011011111011101111101111011010010011100101100000011011001011101111101111011011111011101111101111011010110111101000101110111011100011101111101111011011111011101111101111011010101111101000101101011010011011101111101111101000110001000010 efbdbeefbdace982aae6a89fe58690efbdbfefbdbeefbdabe9b4abefbdbeefbda4e581b2efbdbeefbdade8bbb8efbdbeefbdabe8b5a6efbe8c42
UHC ??邪樟??????????軸??赦?B 001111110011111111011110111101111110110111101001001111110011111100111111001111110011111100111111001111110011111100111111001111111111010111101110001111110011111111011110111101010011111101000010 3f3fdef7ede93f3f3f3f3f3f3f3f3f3ff5ee3f3fdef53f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)