To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セ蜴ソ湿而セヒ治シ狎蜴ソ湿而セヒ治ジ^ 101111101110010110001110101111111000111010111100100011101010011110111110110010111111000111101110100011101010000110111100111000001011111011100101100011101011111110001110101111001000111010100111101111101100101111110001111011101000111010100001101111001101111001011110 bee58ebf8ebc8ea7becbf1ee8ea1bce0bee58ebf8ebc8ea7becbf1ee8ea1bcde5e
EUC-JP セ蜴ソ湿而セヒ?治シ狎蜴ソ湿而セヒ?治ジ^ 1000111010111110111010011110111010001110101111111011110010111110101111001010100110001110101111101000111011001011001111111011110010100011100011101011110011100000110000001110100111101110100011101011111110111100101111101011110010101001100011101011111010001110110010110011111110111100101000111000111010111100100011101101111001011110 8ebee9ee8ebfbcbebca98ebe8ecb3fbca38ebce0c0e9ee8ebfbcbebca98ebe8ecb3fbca38ebc8ede5e
UTF-8 セ蜴ソ湿而セヒ治シ狎蜴ソ湿而セヒ治ジ^ 11101111101111011011111011101000100111001011010011101111101111011011111111100110101110011011111111101000100000001000110011101111101111011011111011101111101111101000101111101110100001011010100111100110101100101011101111101111101111011011110011100111100010111000111011101000100111001011010011101111101111011011111111100110101110011011111111101000100000001000110011101111101111011011111011101111101111101000101111101110100001011010100111100110101100101011101111101111101111011011110011101111101111101001111001011110 efbdbee89cb4efbdbfe6b9bfe8808cefbdbeefbe8bee85a9e6b2bbefbdbce78b8ee89cb4efbdbfe6b9bfe8808cefbdbeefbe8bee85a9e6b2bbefbdbcefbe9e5e
UHC ????而???治?狎???而???治??^ 001111110011111100111111001111111110110010111011001111110011111100111111111101101011110100111111111001001110010000111111001111110011111111101100101110110011111100111111001111111111011010111101001111110011111101011110 3f3f3f3fecbb3f3f3ff6bd3fe4e43f3f3fecbb3f3f3ff6bd3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)