To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 綜???趙???蹄?建綜???趙???蹄?建^ 10010001100011100011111100111111001111111110011011100010001111110011111100111111100100101111101100111111100011001001101010010001100011100011111100111111001111111110011011100010001111110011111100111111100100101111101100111111100011001001101001011110 918e3f3f3fe6e23f3f3f92fb3f8c9a918e3f3f3fe6e23f3f3f92fb3f8c9a5e
EUC-JP 綜???趙???蹄?建綜???趙???蹄?建^ 11000001111011100011111100111111001111111110110011100100001111110011111100111111110001001111110100111111101101111111101011000001111011100011111100111111001111111110110011100100001111110011111100111111110001001111110100111111101101111111101001011110 c1ee3f3f3fece43f3f3fc4fd3fb7fac1ee3f3f3fece43f3f3fc4fd3fb7fa5e
UTF-8 綜찔렰렞趙뀜렰렯蹄븍建綜찔렰렞趙뀜렰렯蹄븍建^ 11100111101101101001110011101100101100001001010011101011101000001011000011101011101000001001111011101000101101101001100111101011100000001001110011101011101000001011000011101011101000001010111111101000101110011000010011101011101110001000110111100101101110111011101011100111101101101001110011101100101100001001010011101011101000001011000011101011101000001001111011101000101101101001100111101011100000001001110011101011101000001011000011101011101000001010111111101000101110011000010011101011101110001000110111100101101110111011101001011110 e7b69cecb094eba0b0eba09ee8b699eb809ceba0b0eba0afe8b984ebb88de5bbbae7b69cecb094eba0b0eba09ee8b699eb809ceba0b0eba0afe8b984ebb88de5bbba5e
UHC 綜찔렰렞趙뀜렰렯蹄븍建綜찔렰렞趙뀜렰렯蹄븍建^ 111100001111110011000010111100011000111010111101100011101010111111110000111000011011001011110001100011101011110110001110101111001111000010110100101110101110101111001011111011111111000011111100110000101111000110001110101111011000111010101111111100001110000110110010111100011000111010111101100011101011110011110000101101001011101011101011110010111110111101011110 f0fcc2f18ebd8eaff0e1b2f18ebd8ebcf0b4baebcbeff0fcc2f18ebd8eaff0e1b2f18ebd8ebcf0b4baebcbef5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)