To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セュ自セャ紗セュ鉐爾詔ニセュ自セャ紗セュ鉐爾証ム^ 1011111010101101100011101010100110111110101011001000111011010001101111101010110111100111111011111000111010100010100011111101100111000110101111101010110110001110101010011011111010101100100011101101000110111110101011011110011111101111100011101010001010001111110110001101000101011110 bead8ea9beac8ed1beade7ef8ea28fd9c6bead8ea9beac8ed1beade7ef8ea28fd8d15e
EUC-JP セュ自セャ紗セュ鉐爾詔ニセュ自セャ紗セュ鉐爾証ム^ 10001110101111101000111010101101101111001010101110001110101111101000111010101100101111001101001110001110101111101000111010101101111011101111000110111100101001001011111011011011100011101100011010001110101111101000111010101101101111001010101110001110101111101000111010101100101111001101001110001110101111101000111010101101111011101111000110111100101001001011111011011010100011101101000101011110 8ebe8eadbcab8ebe8eacbcd38ebe8eadeef1bca4bedb8ec68ebe8eadbcab8ebe8eacbcd38ebe8eadeef1bca4beda8ed15e
UTF-8 セュ自セャ紗セュ鉐爾詔ニセュ自セャ紗セュ鉐爾証ム^ 11101111101111011011111011101111101111011010110111101000100001111010101011101111101111011011111011101111101111011010110011100111101101001001011111101111101111011011111011101111101111011010110111101001100010011001000011100111100010001011111011101000101010011001010011101111101111101000011011101111101111011011111011101111101111011010110111101000100001111010101011101111101111011011111011101111101111011010110011100111101101001001011111101111101111011011111011101111101111011010110111101001100010011001000011100111100010001011111011101000101010001011110011101111101111101001000101011110 efbdbeefbdade887aaefbdbeefbdace7b497efbdbeefbdade98990e788bee8a994efbe86efbdbeefbdade887aaefbdbeefbdace7b497efbdbeefbdade98990e788bee8a8bcefbe915e
UHC ??自??紗???爾詔???自??紗???爾??^ 0011111100111111111011011011101100111111001111111101111011101001001111110011111100111111111011001011001111110000110111110011111100111111001111111110110110111011001111110011111111011110111010010011111100111111001111111110110010110011001111110011111101011110 3f3fedbb3f3fdee93f3f3fecb3f0df3f3f3fedbb3f3fdee93f3f3fecb33f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)