To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 竪存袖脱則束奪属孫i竪存袖脱則束奪属孫iB 100100100100011110010001101101101001000110110011100100100100010110010001101001011001000110101001100100100100010010010001101011101001000110110111011010011001001001000111100100011011011010010001101100111001001001000101100100011010010110010001101010011001001001000100100100011010111010010001101101110110100101000010 924791b691b3924591a591a9924491ae91b769924791b691b3924591a591a9924491ae91b76942
EUC-JP 竪存袖脱則束奪属孫i竪存袖脱則束奪属孫iB 110000111010100011000010101110001100001010110101110000111010011011000010101001111100001010101011110000111010010111000010101100001100001010111001011010011100001110101000110000101011100011000010101101011100001110100110110000101010011111000010101010111100001110100101110000101011000011000010101110010110100101000010 c3a8c2b8c2b5c3a6c2a7c2abc3a5c2b0c2b969c3a8c2b8c2b5c3a6c2a7c2abc3a5c2b0c2b96942
UTF-8 竪存袖脱則束奪属孫i竪存袖脱則束奪属孫iB 111001111010101110101010111001011010110110011000111010001010001010010110111010001000010010110001111001011000100110000111111001101001110110011111111001011010010110101010111001011011000110011110111001011010110110101011011010011110011110101011101010101110010110101101100110001110100010100010100101101110100010000100101100011110010110001001100001111110011010011101100111111110010110100101101010101110010110110001100111101110010110101101101010110110100101000010 e7abaae5ad98e8a296e884b1e58987e69d9fe5a5aae5b19ee5adab69e7abaae5ad98e8a296e884b1e58987e69d9fe5a5aae5b19ee5adab6942
UHC 竪存袖?則束奪?孫i竪存袖?則束奪?孫iB 1110001010110101111100001110110111100010110000000011111111110110110011101110000111010110111101111010110000111111111000011101110101101001111000101011010111110000111011011110001011000000001111111111011011001110111000011101011011110111101011000011111111100001110111010110100101000010 e2b5f0ede2c03ff6cee1d6f7ac3fe1dd69e2b5f0ede2c03ff6cee1d6f7ac3fe1dd6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)