To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???旭?????鹽???旭????????B 0011111100111111001111111000100010101110001111110011111100111111001111110011111111101010011001000011111100111111001111111000100010101110001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f88ae3f3f3f3f3fea643f3f3f88ae3f3f3f3f3f3f3f3f42
EUC-JP ???旭?????鹽???旭????????B 0011111100111111001111111011000010110000001111110011111100111111001111110011111111110011110001010011111100111111001111111011000010110000001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3fb0b03f3f3f3f3ff3c53f3f3fb0b03f3f3f3f3f3f3f3f42
UTF-8 쒀렲쒀旭렖롍렊렚쒔鹽쒀렲쒀旭렖롍렊렚쒔렊쒀롎B 11101100100100101000000011101011101000001011001011101100100100101000000011100110100101111010110111101011101000001001011011101011101000011000110111101011101000001000101011101011101000001001101011101100100100101001010011101001101110011011110111101100100100101000000011101011101000001011001011101100100100101000000011100110100101111010110111101011101000001001011011101011101000011000110111101011101000001000101011101011101000001001101011101100100100101001010011101011101000001000101011101100100100101000000011101011101000011000111001000010 ec9280eba0b2ec9280e697adeba096eba18deba08aeba09aec9294e9b9bdec9280eba0b2ec9280e697adeba096eba18deba08aeba09aec9294eba08aec9280eba18e42
UHC 쒀렲쒀旭렖롍렊렚쒔鹽쒀렲쒀旭렖롍렊렚쒔렊쒀롎B 101111101010110010001110101111111011111010101100111010011110111110001110101010111000111011010011100011101010000110001110101011011011111010101101111001111010010010111110101011001000111010111111101111101010110011101001111011111000111010101011100011101101001110001110101000011000111010101101101111101010110110001110101000011011111010101100100011101101010001000010 beac8ebfbeace9ef8eab8ed38ea18eadbeade7a4beac8ebfbeace9ef8eab8ed38ea18eadbead8ea1beac8ed442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)