To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 藥?ク墺?????藥?ク墺??餓??E 1110010101011010001111111000001101001110100110101101001000111111001111110011111100111111001111111110010101011010001111111000001101001110100110101101001000111111001111111000100111101100001111110011111101000101 e55a3f834e9ad23f3f3f3f3fe55a3f834e9ad23f3f89ec3f3f45
EUC-JP 藥?ク墺??孼??藥?ク墺??餓??E 11101001101110110011111110100101101011111101010011010100001111110011111110001111101110101100001100111111001111111110100110111011001111111010010110101111110101001101010000111111001111111011001011101110001111110011111101000101 e9bb3fa5afd4d43f3f8fbac33f3fe9bb3fa5afd4d43f3fb2ee3f3f45
UTF-8 藥썹ク墺듣꽦孼껅릫藥썹ク墺듣꽦餓뽬퀕E 11101000100101111010010111101100100011011011100111100011100000101010111111100101101000101011101011101011100100111010001111101010101111011010011011100101101011011011110011101010101110111000010111101011101001101010101111101000100101111010010111101100100011011011100111100011100000101010111111100101101000101011101011101011100100111010001111101010101111011010011011101001101001001001001111101011101111011010110011101101100000001001010101000101 e897a5ec8db9e382afe5a2baeb93a3eabda6e5adbceabb85eba6abe897a5ec8db9e382afe5a2baeb93a3eabda6e9a493ebbdaced809545
UHC 藥썹ク墺듣꽦孼껅릫藥썹ク墺듣꽦餓뽬퀕E 11100101101101111011110111100111101010111010111111100111111100101011010111101000100001001011000111100101111011011000001111100110100100001000110111100101101101111011110111100111101010111010111111100111111100101011010111101000100001001011000111100100101110111001011011101000101100111000101001000101 e5b7bde7abafe7f2b5e884b1e5ed83e6908de5b7bde7abafe7f2b5e884b1e4bb96e8b38a45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)