To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 藥??鎰??酉??也ы?藥??鎰??酉??也ы?E 1110010101011010001111110011111111101000010011000011111100111111100100111101000100111111001111111001011011100111100001001000110100111111111001010101101000111111001111111110100001001100001111110011111110010011110100010011111100111111100101101110011110000100100011010011111101000101 e55a3f3fe84c3f3f93d13f3f96e7848d3fe55a3f3fe84c3f3f93d13f3f96e7848d3f45
EUC-JP 藥??鎰??酉??也ы?藥??鎰??酉??也ы?E 1110100110111011001111110011111111101111101011010011111100111111110001101101001100111111001111111100110011101001101001111110110100111111111010011011101100111111001111111110111110101101001111110011111111000110110100110011111100111111110011001110100110100111111011010011111101000101 e9bb3f3fefad3f3fc6d33f3fcce9a7ed3fe9bb3f3fefad3f3fc6d33f3fcce9a7ed3f45
UTF-8 藥띲끏鎰먪독酉곴덮也ы뎸藥띲끏鎰먪독酉곴덮也ы뎻E 1110100010010111101001011110101110011101101100101110101110000001100011111110100110001110101100001110101110101000101010101110101110001111100001011110100110000101100010011110101010110011101101001110101110001101101011101110010010111001100111111101000110001011111010111000111010111000111010001001011110100101111010111001110110110010111010111000000110001111111010011000111010110000111010111010100010101010111010111000111110000101111010011000010110001001111010101011001110110100111010111000110110101110111001001011100110011111110100011000101111101011100011101011101101000101 e897a5eb9db2eb818fe98eb0eba8aaeb8f85e98589eab3b4eb8daee4b99fd18beb8eb8e897a5eb9db2eb818fe98eb0eba8aaeb8f85e98589eab3b4eb8daee4b99fd18beb8ebb45
UHC 藥띲끏鎰먪독酉곴덮也ы뎸藥띲끏鎰먪독酉곴덮也ы뎻E 11100101101101111000110111100011100001011011111111101100111100001001000011100111101101011011011011101011101101111000000111101010101101011010010011100101101001011010110011101101100010011000101111100101101101111000110111100011100001011011111111101100111100001001000011100111101101011011011011101011101101111000000111101010101101011010010011100101101001011010110011101101100010011000111001000101 e5b78de385bfecf090e7b5b6ebb781eab5a4e5a5aced898be5b78de385bfecf090e7b5b6ebb781eab5a4e5a5aced898e45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)