To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??語⑤?蘂???ы??ル?茹??蘂??凹 11100101010000010011111100111111100011001110101010000111010001000011111111100101010000010011111100111111001111111000010010001101001111110011111110000011100010110011111111100100101001010011111100111111111001010100000100111111001111111000100110011010 e5413f3f8cea87443fe5413f3f3f848d3f3f838b3fe4a53f3fe5413f3f899a
EUC-JP 蘂??語??蘂???ы??ル?茹??蘂??凹 111010011010001000111111001111111011100011101100001111110011111111101001101000100011111100111111001111111010011111101101001111110011111110100101111010110011111111101000101001110011111100111111111010011010001000111111001111111011000111111010 e9a23f3fb8ec3f3fe9a23f3f3fa7ed3f3fa5eb3fe8a73f3fe9a23f3fb1fa
UTF-8 蘂뚮졁語⑤젇蘂뚮졁吳ы쓻溜ル졁茹됰젪蘂뚮졁凹 1110100010011000100000101110101110011010101011101110110010100001100000011110100010101010100111101110001010010001101001001110110010100000100001111110100010011000100000101110101110011010101011101110110010100001100000011110010110010000101100111101000110001011111011001001001110111011111011111010011110001011111000111000001110101011111011001010000110000001111010001000110010111001111010111001000010110000111011001010000010101010111010001001100010000010111010111001101010101110111011001010000110000001111001011000011110111001 e89882eb9aaeeca181e8aa9ee291a4eca087e89882eb9aaeeca181e590b3d18bec93bbefa78be383abeca181e88cb9eb90b0eca0aae89882eb9aaeeca181e587b9
UHC 蘂뚮졁語⑤젇蘂뚮졁吳ы쓻溜ル졁茹됰젪蘂뚮졁凹 1110011111011110100011001110101110100000101100101110010111011110101010001110101110100000100010101110011111011110100011001110101110100000101100101110011111101111101011001110110110011101100101101110101011111110101010111110101110100000101100101110011010101010100010011110101110100000101000101110011111011110100011001110101110100000101100101110100011101010 e7de8ceba0b2e5dea8eba08ae7de8ceba0b2e7efaced9d96eafeabeba0b2e6aa89eba0a2e7de8ceba0b2e8ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)