To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蠍エ隶悟キ仙符蝗纂蠍エ隶悟キ仙符蝗纂^ 111001011011011010110100111010001010111010001100111001011011011110010000111001011001010110000100111001011001101110001110010110111110010110110110101101001110100010101110100011001110010110110111100100001110010110010101100001001110010110011011100011100101101101011110 e5b6b4e8ae8ce5b790e59584e59b8e5be5b6b4e8ae8ce5b790e59584e59b8e5b5e
EUC-JP 蠍エ隶悟キ仙符蝗纂蠍エ隶悟キ仙符蝗纂^ 11101010101110001000111010110100111100001011000010111000111001111000111010110111110000001110011111001001111001001110100111111011101110111011110011101010101110001000111010110100111100001011000010111000111001111000111010110111110000001110011111001001111001001110100111111011101110111011110001011110 eab88eb4f0b0b8e78eb7c0e7c9e4e9fbbbbceab88eb4f0b0b8e78eb7c0e7c9e4e9fbbbbc5e
UTF-8 蠍エ隶悟キ仙符蝗纂蠍エ隶悟キ仙符蝗纂^ 11101000101000001000110111101111101111011011010011101001100110101011011011100110100000101001111111101111101111011011011111100100101110111001100111100111101011001010011011101000100111011001011111100111101110101000001011101000101000001000110111101111101111011011010011101001100110101011011011100110100000101001111111101111101111011011011111100100101110111001100111100111101011001010011011101000100111011001011111100111101110101000001001011110 e8a08defbdb4e99ab6e6829fefbdb7e4bb99e7aca6e89d97e7ba82e8a08defbdb4e99ab6e6829fefbdb7e4bb99e7aca6e89d97e7ba825e
UHC ???悟?仙符蝗纂???悟?仙符蝗纂^ 0011111100111111001111111110011111110110001111111110000010111001110111011010110011111100110110011111001111000011001111110011111100111111111001111111011000111111111000001011100111011101101011001111110011011001111100111100001101011110 3f3f3fe7f63fe0b9ddacfcd9f3c33f3f3fe7f63fe0b9ddacfcd9f3c35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)