To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????@?????????@B 001111110011111100111111001111110011111100111111001111110011111100111111010000000011111100111111001111110011111100111111001111110011111100111111001111110100000001000010 3f3f3f3f3f3f3f3f3f403f3f3f3f3f3f3f3f3f4042
SJIS-WIN 艾??苡?????@艾??苡?????@B 11100100100010000011111100111111111001001000111100111111001111110011111100111111001111110100000011100100100010000011111100111111111001001000111100111111001111110011111100111111001111110100000001000010 e4883f3fe48f3f3f3f3f3f40e4883f3fe48f3f3f3f3f3f4042
EUC-JP 艾??苡?????@艾??苡?????@B 11100111111010000011111100111111111001111110111100111111001111110011111100111111001111110100000011100111111010000011111100111111111001111110111100111111001111110011111100111111001111110100000001000010 e7e83f3fe7ef3f3f3f3f3f40e7e83f3fe7ef3f3f3f3f3f4042
UTF-8 艾싲강苡득릸琉밸엑@艾싲강苡득릸琉밸엑@B 111010001000100110111110111011001000101110110010111010101011000010010101111010001000101110100001111010111001001110011101111010111010011010111000111011111010011110001100111010111011000010111000111011001001011110010001010000001110100010001001101111101110110010001011101100101110101010110000100101011110100010001011101000011110101110010011100111011110101110100110101110001110111110100111100011001110101110110000101110001110110010010111100100010100000001000010 e889beec8bb2eab095e88ba1eb939deba6b8efa78cebb0b8ec979140e889beec8bb2eab095e88ba1eb939deba6b8efa78cebb0b8ec97914042
UHC 艾싲강苡득릸琉밸엑@艾싲강苡득릸琉밸엑@B 111001001111010110011010111010111011000010101101111011001011111010110101111001101001000010010110111010111010010010111001111010111011111110100010010000001110010011110101100110101110101110110000101011011110110010111110101101011110011010010000100101101110101110100100101110011110101110111111101000100100000001000010 e4f59aebb0adecbeb5e69096eba4b9ebbfa240e4f59aebb0adecbeb5e69096eba4b9ebbfa24042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)