To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 澳??撓θす受↑?扼??節??沃??穩??^ 1110000001010011001111110011111110011101100110101000001111000110100000101011011110001110111100111000000110101010001111111001110101001110001111110011111110010000110111110011111100111111100101111000000000111111001111111110001001110010001111110011111101011110 e0533f3f9d9a83c682b78ef381aa3f9d4e3f3f90df3f3f97803f3fe2723f3f5e
EUC-JP 澳??撓θす受↑?扼??節??沃??穩??^ 1101111110110100001111110011111111011001111110101010011011001000101001001011100110111100111101011010001010101100001111111101100110101111001111110011111111000000111000010011111100111111110011011110000000111111001111111110001111010011001111110011111101011110 dfb43f3fd9faa6c8a4b9bcf5a2ac3fd9af3f3fc0e13f3fcde03f3fe3d33f3f5e
UTF-8 澳뉛슐撓θす受↑콒扼녘젃節띌뿥沃밟뼞穩롩젅^ 111001101011111010110011111010111000100110011011111011001000101010010000111001101001001010010011110011101011100011100011100000011001100111100101100011111001011111100010100001101001000111101100101111011001001011100110100010011011110011101011100001011001100011101100101000001000001111100111101011111000000011101011100111011000110011101011101111111010010111100110101100101000001111101011101100001001111111101011101111001001111011100111101010011010100111101011101000011010100111101100101000001000010101011110 e6beb3eb899bec8a90e69293ceb8e38199e58f97e28691ecbd92e689bceb8598eca083e7af80eb9d8cebbfa5e6b283ebb09febbc9ee7a9a9eba1a9eca0855e
UHC 澳뉛슐撓θす受↑콒扼녘젃節띌뿥沃밟뼞穩롩젅^ 11100111111111101000011111101111101111011011011011101000111101011010010111101000101010101011100111100001111101001010000111101000101100011000111011100100111110011011001111101000101000001000011111101111101111011011011011101001100101111010010111101000101010101011100111100010100101101010000111101000101100011000111011101001101000001000100001011110 e7fe87efbdb6e8f5a5e8aab9e1f4a1e8b18ee4f9b3e8a087efbdb6e997a5e8aab9e296a1e8b18ee9a0885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)