To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ?莎?舍??酊??n}?莎?舍??酊??n{^ 0011111111100100101100110011111111100100011100010011111100111111111001111100001000111111001111110110111001111101001111111110010010110011001111111110010001110001001111110011111111100111110000100011111100111111011011100111101101011110 3fe4b33fe4713f3fe7c23f3f6e7d3fe4b33fe4713f3fe7c23f3f6e7b5e
EUC-JP 蔣莎?舍??酊??n}蔣莎?舍??酊??n{^ 100011111101100110110110111010001011010100111111111001111101001000111111001111111110111011000100001111110011111101101110011111011000111111011001101101101110100010110101001111111110011111010010001111110011111111101110110001000011111100111111011011100111101101011110 8fd9b6e8b53fe7d23f3feec43f3f6e7d8fd9b6e8b53fe7d23f3feec43f3f6e7b5e
UTF-8 蔣莎렍舍렡렋酊댄쨍n}蔣莎렍舍렡렋酊댄쨍n{^ 1110100010010100101000111110100010001110100011101110101110100000100011011110100010001000100011011110101110100000101000011110101110100000100010111110100110000101100010101110101110001100100001001110110010101000100011010110111001111101111010001001010010100011111010001000111010001110111010111010000010001101111010001000100010001101111010111010000010100001111010111010000010001011111010011000010110001010111010111000110010000100111011001010100010001101011011100111101101011110 e894a3e88e8eeba08de8888deba0a1eba08be9858aeb8c84eca88d6e7de894a3e88e8eeba08de8888deba0a1eba08be9858aeb8c84eca88d6e7b5e
UHC 蔣莎렍舍렡렋酊댄쨍n}蔣莎렍舍렡렋酊댄쨍n{^ 1110110111111000110111101110110110001110101000111101111011101100100011101011001010001110101000101110111111111000101101001110110111000010101110000110111001111101111011011111100011011110111011011000111010100011110111101110110010001110101100101000111010100010111011111111100010110100111011011100001010111000011011100111101101011110 edf8deed8ea3deec8eb28ea2eff8b4edc2b86e7dedf8deed8ea3deec8eb28ea2eff8b4edc2b86e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)