To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 七社灼失蔀蒔実社n}七社灼失蔀蒔実社n{^ 10001110101101011000111011010000100011101101110010001110101110001000111011000001100011101010101010001110110000001000111011010000011011100111110110001110101101011000111011010000100011101101110010001110101110001000111011000001100011101010101010001110110000001000111011010000011011100111101101011110 8eb58ed08edc8eb88ec18eaa8ec08ed06e7d8eb58ed08edc8eb88ec18eaa8ec08ed06e7b5e
EUC-JP 七社灼失蔀蒔実社n}七社灼失蔀蒔実社n{^ 10111100101101111011110011010010101111001101111010111100101110101011110011000011101111001010110010111100110000101011110011010010011011100111110110111100101101111011110011010010101111001101111010111100101110101011110011000011101111001010110010111100110000101011110011010010011011100111101101011110 bcb7bcd2bcdebcbabcc3bcacbcc2bcd26e7dbcb7bcd2bcdebcbabcc3bcacbcc2bcd26e7b5e
UTF-8 七社灼失蔀蒔実社n}七社灼失蔀蒔実社n{^ 1110010010111000100000111110011110100100101111101110011110000001101111001110010110100100101100011110100010010100100000001110100010010010100101001110010110101110100111111110011110100100101111100110111001111101111001001011100010000011111001111010010010111110111001111000000110111100111001011010010010110001111010001001010010000000111010001001001010010100111001011010111010011111111001111010010010111110011011100111101101011110 e4b883e7a4bee781bce5a4b1e89480e89294e5ae9fe7a4be6e7de4b883e7a4bee781bce5a4b1e89480e89294e5ae9fe7a4be6e7b5e
UHC 七社灼失?蒔?社n}七社灼失?蒔?社n{^ 111101101101001011011110111001001110110111000111111000111111011100111111111000111100100000111111110111101110010001101110011111011111011011010010110111101110010011101101110001111110001111110111001111111110001111001000001111111101111011100100011011100111101101011110 f6d2dee4edc7e3f73fe3c83fdee46e7df6d2dee4edc7e3f73fe3c83fdee46e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)