To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????~ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101111110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7e
SJIS-WIN ?????ぜ毓??柔ル??щ????儒??~ 00111111001111110011111100111111001111111000001010111010100111110111100100111111001111111000111101011111100000111000101100111111001111111000010010001011001111110011111100111111001111111000111011110010001111110011111101111110 3f3f3f3f3f82ba9f793f3f8f5f838b3f3f848b3f3f3f3f8ef23f3f7e
EUC-JP ???沅?ぜ毓??柔ル??щ?彛??儒??~ 0011111100111111001111111000111111000110111010010011111110100100101111001101110111011010001111110011111110111101110000001010010111101011001111110011111110100111111010110011111110001111101111001111101000111111001111111011110011110100001111110011111101111110 3f3f3f8fc6e93fa4bcddda3f3fbdc0a5eb3f3fa7eb3f8fbcfa3f3fbcf43f3f7e
UTF-8 嶺뚮뿭沅좄ぜ毓쀧춯柔ル읇嶪щ쨪彛볞튃儒뱀젧~ 111011111010011010101011111010111001101010101110111010111011111110101101111001101011001010000101111011001010001010000100111000111000000110011100111001101010111110010011111011001000000010100111111011001011011010101111111001101001111110010100111000111000001110101011111011001001110110000111111001011011011010101010110100011000100111101100101010001010101011100101101111011001101111101011101100111001111011101101100010101000001111100101100001001001001011101011101100011000000011101100101000001010011101111110 efa6abeb9aaeebbfade6b285eca284e3819ce6af93ec80a7ecb6afe69f94e383abec9d87e5b6aad189eca8aae5bd9bebb39eed8a83e58492ebb180eca0a77e
UHC 嶺뚮뿭沅좄ぜ毓쀧춯柔ル읇嶪щ쨪彛볞튃儒뱀젧~ 11100111101011011000110011101011100101111010110111101010101101101010000011101000101010101011110011101011101111101001011111100111101011011000110011101010111101011010101111101011100111111011110111100101111101011010110011101011101001001000010011101100101011011001001111100100101110011001100111101010111000111011100111101100101000001001111101111110 e7ad8ceb97adeab6a0e8aabcebbe97e7ad8ceaf5abeb9fbde5f5aceba484ecad93e4b999eae3b9eca09f7e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)