To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN シス鴫ヤ骼ウサ・シス鴫ヤ骼ウサ・^ 111100001111110010111100111100011000111010111101100011101011000011110000101011111101010011101001100011101011001110111011101001011111000011111100101111001111000110001110101111011000111010110000111100001010111111010100111010011000111010110011101110111010010101011110 f0fcbcf18ebd8eb0f0afd4e98eb3bba5f0fcbcf18ebd8eb0f0afd4e98eb3bba55e
EUC-JP ?シ?ス鴫?ヤ骼ウサ・?シ?ス鴫?ヤ骼ウサ・^ 001111111000111010111100001111111000111010111101101111001011001000111111100011101101010011110001111011101000111010110011100011101011101110001110101001010011111110001110101111000011111110001110101111011011110010110010001111111000111011010100111100011110111010001110101100111000111010111011100011101010010101011110 3f8ebc3f8ebdbcb23f8ed4f1ee8eb38ebb8ea53f8ebc3f8ebdbcb23f8ed4f1ee8eb38ebb8ea55e
UTF-8 シス鴫ヤ骼ウサ・シス鴫ヤ骼ウサ・^ 11101110100000101011101111101111101111011011110011101110100001001000100111101111101111011011110111101001101101001010101111101110100000011010111011101111101111101001010011101001101010101011110011101111101111011011001111101111101111011011101111101111101111011010010111101110100000101011101111101111101111011011110011101110100001001000100111101111101111011011110111101001101101001010101111101110100000011010111011101111101111101001010011101001101010101011110011101111101111011011001111101111101111011011101111101111101111011010010101011110 ee82bbefbdbcee8489efbdbde9b4abee81aeefbe94e9aabcefbdb3efbdbbefbda5ee82bbefbdbcee8489efbdbde9b4abee81aeefbe94e9aabcefbdb3efbdbbefbda55e
UHC ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)