To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????ぜ淫??猥??悠??銀レ?沃??? 0011111100111111001111110011111100111111100000101011101010001000111110100011111100111111111000001100111000111111001111111001011101001001001111110011111110001011111000101000001110001100001111111001011110000000001111110011111100111111 3f3f3f3f3f82ba88fa3f3fe0ce3f3f97493f3f8be2838c3f97803f3f3f
EUC-JP ???沅?ぜ淫??猥?Ŧ悠??銀レ?沃??彛 0011111100111111001111111000111111000110111010010011111110100100101111001011000011111100001111110011111111100000110100000011111110001111101010011010111111001101101010100011111100111111101101101110010010100101111011000011111111001101111000000011111100111111100011111011110011111010 3f3f3f8fc6e93fa4bcb0fc3f3fe0d03f8fa9afcdaa3f3fb6e4a5ec3fcde03f3f8fbcfa
UTF-8 嶺뚮뿭沅좄ぜ淫볦춷猥됰Ŧ悠뺟뙠銀レ돱沃쇱룊彛 1110111110100110101010111110101110011010101011101110101110111111101011011110011010110010100001011110110010100010100001001110001110000001100111001110011010110111101010111110101110110011101001101110110010110110101101111110011110001100101001011110101110010000101100001100010110100110111001101000001010100000111010111011101010011111111010111001100110100000111010011000101010000000111000111000001110101100111010111000111110110001111001101011001010000011111011001000011110110001111010111010001110001010111001011011110110011011 efa6abeb9aaeebbfade6b285eca284e3819ce6b7abebb3a6ecb6b7e78ca5eb90b0c5a6e682a0ebba9feb99a0e98a80e383aceb8fb1e6b283ec87b1eba38ae5bd9b
UHC 嶺뚮뿭沅좄ぜ淫볦춷猥됰Ŧ悠뺟뙠銀レ돱沃쇱룊彛 1110011110101101100011001110101110010111101011011110101010110110101000001110100010101010101111001110101111100010100100111110110010101101100100111110100011100101100010011110101110101000101011101110101011101101100101011110011110001100101001011110101111011110101010111110110010001001101101001110100010101010101111001110110010001111100010011110110010101101 e7ad8ceb97adeab6a0e8aabcebe293ecad93e8e589eba8aeeaed95e78ca5ebdeabec89b4e8aabcec8f89ecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)