To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???旭???????????旭?????鹽^ 0011111100111111001111111000100010101110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001000101011100011111100111111001111110011111100111111111010100110010001011110 3f3f3f88ae3f3f3f3f3f3f3f3f3f3f3f88ae3f3f3f3f3fea645e
EUC-JP ???旭???????????旭?????鹽^ 0011111100111111001111111011000010110000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110110000101100000011111100111111001111110011111100111111111100111100010101011110 3f3f3fb0b03f3f3f3f3f3f3f3f3f3f3fb0b03f3f3f3f3ff3c55e
UTF-8 쒀렲쒀旭렖롍렊렚쒔롍쒀렺쒀렲쒀旭렖롍렊렚쒔鹽^ 11101100100100101000000011101011101000001011001011101100100100101000000011100110100101111010110111101011101000001001011011101011101000011000110111101011101000001000101011101011101000001001101011101100100100101001010011101011101000011000110111101100100100101000000011101011101000001011101011101100100100101000000011101011101000001011001011101100100100101000000011100110100101111010110111101011101000001001011011101011101000011000110111101011101000001000101011101011101000001001101011101100100100101001010011101001101110011011110101011110 ec9280eba0b2ec9280e697adeba096eba18deba08aeba09aec9294eba18dec9280eba0baec9280eba0b2ec9280e697adeba096eba18deba08aeba09aec9294e9b9bd5e
UHC 쒀렲쒀旭렖롍렊렚쒔롍쒀렺쒀렲쒀旭렖롍렊렚쒔鹽^ 101111101010110010001110101111111011111010101100111010011110111110001110101010111000111011010011100011101010000110001110101011011011111010101101100011101101001110111110101011001000111011000010101111101010110010001110101111111011111010101100111010011110111110001110101010111000111011010011100011101010000110001110101011011011111010101101111001111010010001011110 beac8ebfbeace9ef8eab8ed38ea18eadbead8ed3beac8ec2beac8ebfbeace9ef8eab8ed38ea18eadbeade7a45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)