To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セュ宍爾捨爾捨セュ宍爾捨治識セュ宍爾捨治 10111110101011011000111010110011100011101010001010001110110011001000111010100010100011101100110010111110101011011000111010110011100011101010001010001110110011001000111010100001100011101010111110111110101011011000111010110011100011101010001010001110110011001000111010100001 bead8eb38ea28ecc8ea28eccbead8eb38ea28ecc8ea18eafbead8eb38ea28ecc8ea1
EUC-JP セュ宍爾捨爾捨セュ宍爾捨治識セュ宍爾捨治 10001110101111101000111010101101101111001011010110111100101001001011110011001110101111001010010010111100110011101000111010111110100011101010110110111100101101011011110010100100101111001100111010111100101000111011110010110001100011101011111010001110101011011011110010110101101111001010010010111100110011101011110010100011 8ebe8eadbcb5bca4bccebca4bcce8ebe8eadbcb5bca4bccebca3bcb18ebe8eadbcb5bca4bccebca3
UTF-8 セュ宍爾捨爾捨セュ宍爾捨治識セュ宍爾捨治 111011111011110110111110111011111011110110101101111001011010111010001101111001111000100010111110111001101000110110101000111001111000100010111110111001101000110110101000111011111011110110111110111011111011110110101101111001011010111010001101111001111000100010111110111001101000110110101000111001101011001010111011111010001010110110011000111011111011110110111110111011111011110110101101111001011010111010001101111001111000100010111110111001101000110110101000111001101011001010111011 efbdbeefbdade5ae8de788bee68da8e788bee68da8efbdbeefbdade5ae8de788bee68da8e6b2bbe8ad98efbdbeefbdade5ae8de788bee68da8e6b2bb
UHC ???爾捨爾捨???爾捨治識???爾捨治 00111111001111110011111111101100101100111101111011010111111011001011001111011110110101110011111100111111001111111110110010110011110111101101011111110110101111011110001111011011001111110011111100111111111011001011001111011110110101111111011010111101 3f3f3fecb3ded7ecb3ded73f3f3fecb3ded7f6bde3db3f3f3fecb3ded7f6bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)