To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ?莎?舍??酊??}?莎?舍??酊??{^ 001111111110010010110011001111111110010001110001001111110011111111100111110000100011111100111111011111010011111111100100101100110011111111100100011100010011111100111111111001111100001000111111001111110111101101011110 3fe4b33fe4713f3fe7c23f3f7d3fe4b33fe4713f3fe7c23f3f7b5e
EUC-JP 蔣莎?舍??酊??}蔣莎?舍??酊??{^ 10001111110110011011011011101000101101010011111111100111110100100011111100111111111011101100010000111111001111110111110110001111110110011011011011101000101101010011111111100111110100100011111100111111111011101100010000111111001111110111101101011110 8fd9b6e8b53fe7d23f3feec43f3f7d8fd9b6e8b53fe7d23f3feec43f3f7b5e
UTF-8 蔣莎렍舍렡렋酊댄쨌}蔣莎렍舍렡렋酊댄쨌{^ 111010001001010010100011111010001000111010001110111010111010000010001101111010001000100010001101111010111010000010100001111010111010000010001011111010011000010110001010111010111000110010000100111011001010100010001100011111011110100010010100101000111110100010001110100011101110101110100000100011011110100010001000100011011110101110100000101000011110101110100000100010111110100110000101100010101110101110001100100001001110110010101000100011000111101101011110 e894a3e88e8eeba08de8888deba0a1eba08be9858aeb8c84eca88c7de894a3e88e8eeba08de8888deba0a1eba08be9858aeb8c84eca88c7b5e
UHC 蔣莎렍舍렡렋酊댄쨌}蔣莎렍舍렡렋酊댄쨌{^ 111011011111100011011110111011011000111010100011110111101110110010001110101100101000111010100010111011111111100010110100111011011100001010110111011111011110110111111000110111101110110110001110101000111101111011101100100011101011001010001110101000101110111111111000101101001110110111000010101101110111101101011110 edf8deed8ea3deec8eb28ea2eff8b4edc2b77dedf8deed8ea3deec8eb28ea2eff8b4edc2b77b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)