To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鞨ェ辷オ鞨ェ驍ェn}鞨ェ辷オ鞨ェ驍ェn{^ 1110100011100000101010101110011110001000101101011110100011100000101010101110100110000010101010100110111001111101111010001110000010101010111001111000100010110101111010001110000010101010111010011000001010101010011011100111101101011110 e8e0aae788b5e8e0aae982aa6e7de8e0aae788b5e8e0aae982aa6e7b5e
EUC-JP 鞨ェ辷オ鞨ェ驍ェn}鞨ェ辷オ鞨ェ驍ェn{^ 11110000111000101000111010101010111011011110100010001110101101011111000011100010100011101010101011110001111000101000111010101010011011100111110111110000111000101000111010101010111011011110100010001110101101011111000011100010100011101010101011110001111000101000111010101010011011100111101101011110 f0e28eaaede88eb5f0e28eaaf1e28eaa6e7df0e28eaaede88eb5f0e28eaaf1e28eaa6e7b5e
UTF-8 鞨ェ辷オ鞨ェ驍ェn}鞨ェ辷オ鞨ェ驍ェn{^ 1110100110011110101010001110111110111101101010101110100010111110101101111110111110111101101101011110100110011110101010001110111110111101101010101110100110101001100011011110111110111101101010100110111001111101111010011001111010101000111011111011110110101010111010001011111010110111111011111011110110110101111010011001111010101000111011111011110110101010111010011010100110001101111011111011110110101010011011100111101101011110 e99ea8efbdaae8beb7efbdb5e99ea8efbdaae9a98defbdaa6e7de99ea8efbdaae8beb7efbdb5e99ea8efbdaae9a98defbdaa6e7b5e
UHC 鞨???鞨?驍?n}鞨???鞨?驍?n{^ 110010101110101000111111001111110011111111001010111010100011111111111101101001000011111101101110011111011100101011101010001111110011111100111111110010101110101000111111111111011010010000111111011011100111101101011110 caea3f3f3fcaea3ffda43f6e7dcaea3f3f3fcaea3ffda43f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)