To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???Zf???Z^}Y???Zf???Z^}bE 00111111001111110011111101011010011001100011111100111111001111110101101001011110011111010101100100111111001111110011111101011010011001100011111100111111001111110101101001011110011111010110001001000101 3f3f3f5a663f3f3f5a5e7d593f3f3f5a663f3f3f5a5e7d6245
SJIS-WIN 藥??Zf藥??Z^}Y藥??Zf藥??Z^}bE 1110010101011010001111110011111101011010011001101110010101011010001111110011111101011010010111100111110101011001111001010101101000111111001111110101101001100110111001010101101000111111001111110101101001011110011111010110001001000101 e55a3f3f5a66e55a3f3f5a5e7d59e55a3f3f5a66e55a3f3f5a5e7d6245
EUC-JP 藥??Zf藥??Z^}Y藥??Zf藥??Z^}bE 1110100110111011001111110011111101011010011001101110100110111011001111110011111101011010010111100111110101011001111010011011101100111111001111110101101001100110111010011011101100111111001111110101101001011110011111010110001001000101 e9bb3f3f5a66e9bb3f3f5a5e7d59e9bb3f3f5a66e9bb3f3f5a5e7d6245
UTF-8 藥먲쉠Zf藥먲쉠Z^}Y藥먲쉠Zf藥먲쉠Z^}bE 11101000100101111010010111101011101010001011001011101100100010011010000001011010011001101110100010010111101001011110101110101000101100101110110010001001101000000101101001011110011111010101100111101000100101111010010111101011101010001011001011101100100010011010000001011010011001101110100010010111101001011110101110101000101100101110110010001001101000000101101001011110011111010110001001000101 e897a5eba8b2ec89a05a66e897a5eba8b2ec89a05a5e7d59e897a5eba8b2ec89a05a66e897a5eba8b2ec89a05a5e7d6245
UHC 藥먲쉠Zf藥먲쉠Z^}Y藥먲쉠Zf藥먲쉠Z^}bE 11100101101101111001000011101111101111011010101001011010011001101110010110110111100100001110111110111101101010100101101001011110011111010101100111100101101101111001000011101111101111011010101001011010011001101110010110110111100100001110111110111101101010100101101001011110011111010110001001000101 e5b790efbdaa5a66e5b790efbdaa5a5e7d59e5b790efbdaa5a66e5b790efbdaa5a5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)