To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔???????趙麥?拒弔???????趙麥?居^ 100100101010001000111111001111110011111100111111001111110011111100111111111001101110001011101010011011010011111110001011100100011001001010100010001111110011111100111111001111110011111100111111001111111110011011100010111010100110110100111111100010111000111101011110 92a23f3f3f3f3f3f3fe6e2ea6d3f8b9192a23f3f3f3f3f3f3fe6e2ea6d3f8b8f5e
EUC-JP 弔???????趙麥?拒弔???????趙麥?居^ 110001001010010000111111001111110011111100111111001111110011111100111111111011001110010011110011110011100011111110110101111100011100010010100100001111110011111100111111001111110011111100111111001111111110110011100100111100111100111000111111101101011110111101011110 c4a43f3f3f3f3f3f3fece4f3ce3fb5f1c4a43f3f3f3f3f3f3fece4f3ce3fb5ef5e
UTF-8 弔렲罹렗柳얘렕렟趙麥렋拒弔렲罹렗柳얘렕렟趙麥렋居^ 11100101101111001001010011101011101000001011001011101111101001111010011011101011101000001001011111101111101001111000100111101100100101101001100011101011101000001001010111101011101000001001111111101000101101101001100111101001101110101010010111101011101000001000101111100110100010111001001011100101101111001001010011101011101000001011001011101111101001111010011011101011101000001001011111101111101001111000100111101100100101101001100011101011101000001001010111101011101000001001111111101000101101101001100111101001101110101010010111101011101000001000101111100101101100011000010101011110 e5bc94eba0b2efa7a6eba097efa789ec9698eba095eba09fe8b699e9baa5eba08be68b92e5bc94eba0b2efa7a6eba097efa789ec9698eba095eba09fe8b699e9baa5eba08be5b1855e
UHC 弔렲罹렗柳얘렕렟趙麥렋拒弔렲罹렗柳얘렕렟趙麥렋居^ 11110000110000001000111010111111111011001011101010001110101011001110101011110111101111101110101010001110101010101000111010110000111100001110000111011000111010101000111010100010110010111101111011110000110000001000111010111111111011001011101010001110101011001110101011110111101111101110101010001110101010101000111010110000111100001110000111011000111010101000111010100010110010111101110001011110 f0c08ebfecba8eaceaf7beea8eaa8eb0f0e1d8ea8ea2cbdef0c08ebfecba8eaceaf7beea8eaa8eb0f0e1d8ea8ea2cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)