To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????W}???????????W{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 淨???疑???除?∧W}淨???疑???除?∧W{^ 1001111111000100001111110011111100111111100010110101111000111111001111110011111110001111100111000011111110000001110010000101011101111101100111111100010000111111001111110011111110001011010111100011111100111111001111111000111110011100001111111000000111001000010101110111101101011110 9fc43f3f3f8b5e3f3f3f8f9c3f81c8577d9fc43f3f3f8b5e3f3f3f8f9c3f81c8577b5e
EUC-JP 淨???疑???除?∧W}淨???疑???除?∧W{^ 1101111011000110001111110011111100111111101101011011111100111111001111110011111110111101111111000011111110100010110010100101011101111101110111101100011000111111001111110011111110110101101111110011111100111111001111111011110111111100001111111010001011001010010101110111101101011110 dec63f3f3fb5bf3f3f3fbdfc3fa2ca577ddec63f3f3fb5bf3f3f3fbdfc3fa2ca577b5e
UTF-8 淨렠罹렗疑양렏렕除곕∧W}淨렠罹렗疑양렏렕除곕∧W{^ 1110011010110111101010001110101110100000101000001110111110100111101001101110101110100000100101111110011110010110100100011110110010010110100100011110101110100000100011111110101110100000100101011110100110011001101001001110101010110011100101011110001010001000101001110101011101111101111001101011011110101000111010111010000010100000111011111010011110100110111010111010000010010111111001111001011010010001111011001001011010010001111010111010000010001111111010111010000010010101111010011001100110100100111010101011001110010101111000101000100010100111010101110111101101011110 e6b7a8eba0a0efa7a6eba097e79691ec9691eba08feba095e999a4eab395e288a7577de6b7a8eba0a0efa7a6eba097e79691ec9691eba08feba095e999a4eab395e288a7577b5e
UHC 淨렠罹렗疑양렏렕除곕∧W}淨렠罹렗疑양렏렕除곕∧W{^ 11101111111001001000111010110001111011001011101010001110101011001110101111110111101111101110011110001110101001011000111010101010111100001011011010110000111010111010000111111100010101110111110111101111111001001000111010110001111011001011101010001110101011001110101111110111101111101110011110001110101001011000111010101010111100001011011010110000111010111010000111111100010101110111101101011110 efe48eb1ecba8eacebf7bee78ea58eaaf0b6b0eba1fc577defe48eb1ecba8eacebf7bee78ea58eaaf0b6b0eba1fc577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)