To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蛯イ繧駆アソ謠耶ゥ夕蛯イ繧駆アソ謠耶ゥ夕^ 111001011000001010110010111000111000001010001011111011001011000110111111111001101000111110010110111010111010100110010111010110111110010110000010101100101110001110000010100010111110110010110001101111111110011010001111100101101110101110101001100101110101101101011110 e582b2e3828becb1bfe68f96eba9975be582b2e3828becb1bfe68f96eba9975b5e
EUC-JP 蛯イ繧駆アソ謠耶ゥ夕蛯イ繧駆アソ謠耶ゥ夕^ 1110100111100010100011101011001011100101111000101011011011101110100011101011000110001110101111111110101111101111110011001110110110001110101010011100110110111100111010011110001010001110101100101110010111100010101101101110111010001110101100011000111010111111111010111110111111001100111011011000111010101001110011011011110001011110 e9e28eb2e5e2b6ee8eb18ebfebefcced8ea9cdbce9e28eb2e5e2b6ee8eb18ebfebefcced8ea9cdbc5e
UTF-8 蛯イ繧駆アソ謠耶ゥ夕蛯イ繧駆アソ謠耶ゥ夕^ 11101000100110111010111111101111101111011011001011100111101110011010011111101001101001111000011011101111101111011011000111101111101111011011111111101000101011001010000011101000100000001011011011101111101111011010100111100101101001001001010111101000100110111010111111101111101111011011001011100111101110011010011111101001101001111000011011101111101111011011000111101111101111011011111111101000101011001010000011101000100000001011011011101111101111011010100111100101101001001001010101011110 e89bafefbdb2e7b9a7e9a786efbdb1efbdbfe8aca0e880b6efbda9e5a495e89bafefbdb2e7b9a7e9a786efbdb1efbdbfe8aca0e880b6efbda9e5a4955e
UHC ??????謠耶?夕??????謠耶?夕^ 001111110011111100111111001111110011111100111111111010011010101011100101101011010011111111100000101010100011111100111111001111110011111100111111001111111110100110101010111001011010110100111111111000001010101001011110 3f3f3f3f3f3fe9aae5ad3fe0aa3f3f3f3f3f3fe9aae5ad3fe0aa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)