To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 藥?~??????掖??鸚ο????熬ュ?^ 1110010101011010001111111000000101100000001111110011111100111111001111110011111100111111100111010111010000111111001111111110101001011111100000111100110100111111001111110011111100111111111000001001001010000011100001010011111101011110 e55a3f81603f3f3f3f3f3f9d743f3fea5f83cd3f3f3f3fe09283853f5e
EUC-JP 藥?〜?????˙掖??鸚ο????熬ュ?^ 11101001101110110011111110100001110000010011111100111111001111110011111100111111100011111010001010110010110110011101010100111111001111111111001111000000101001101100111100111111001111110011111100111111110111111111001010100101111001010011111101011110 e9bb3fa1c13f3f3f3f3f8fa2b2d9d53f3ff3c0a6cf3f3f3f3fdff2a5e53f5e
UTF-8 藥먩~溫롨변溫롨˙掖쒎톾鸚ο쉔蓼놅쉔熬ュ쐺^ 1110100010010111101001011110101110101000101010011110111110111101100111101110011010111010101010111110101110100001101010001110101110110011100000001110011010111010101010111110101110100001101010001100101110011001111001101000111010010110111011001001001010001110111011011000011010111110111010011011100010011010110011101011111111101100100010011001010011101111101001111000001011101011100001101000010111101100100010011001010011100111100001101010110011100011100000111010010111101100100100001011101001011110 e897a5eba8a9efbd9ee6baabeba1a8ebb380e6baabeba1a8cb99e68e96ec928eed86bee9b89acebfec8994efa782eb8685ec8994e786ace383a5ec90ba5e
UHC 藥먩~溫롨변溫롨˙掖쒎톾鸚ο쉔蓼놅쉔熬ュ쐺^ 11100101101101111001000011100110101000101010011011101000101011101000111011101000101110101010111111101000101011101000111011101000101000101010101111100100111110101001110011100101101101111001000011100101101001001010010111101111101111011010100011101001101001111000011011101111101111011010100011101000101000101010101111100101100111001001110001011110 e5b790e6a2a6e8ae8ee8baafe8ae8ee8a2abe4fa9ce5b790e5a4a5efbda8e9a786efbda8e8a2abe59c9c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)