To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 藥?ク墺?ⅹ癌??藥?ク墺?ⅹ癌??B 1110010101011010001111111000001101001110100110101101001000111111111110100100100110001010111000000011111100111111111001010101101000111111100000110100111010011010110100100011111111111010010010011000101011100000001111110011111101000010 e55a3f834e9ad23ffa498ae03f3fe55a3f834e9ad23ffa498ae03f3f42
EUC-JP 藥?ク墺??癌??藥?ク墺??癌??B 111010011011101100111111101001011010111111010100110101000011111100111111101101001110001000111111001111111110100110111011001111111010010110101111110101001101010000111111001111111011010011100010001111110011111101000010 e9bb3fa5afd4d43f3fb4e23f3fe9bb3fa5afd4d43f3fb4e23f3f42
UTF-8 藥썹ク墺드ⅹ癌닸릍藥썹ク墺드ⅹ癌닸릍B 11101000100101111010010111101100100011011011100111100011100000101010111111100101101000101011101011101011100100111001110011100010100001011011100111100111100110011000110011101011100010111011100011101011101001101000110111101000100101111010010111101100100011011011100111100011100000101010111111100101101000101011101011101011100100111001110011100010100001011011100111100111100110011000110011101011100010111011100011101011101001101000110101000010 e897a5ec8db9e382afe5a2baeb939ce285b9e7998ceb8bb8eba68de897a5ec8db9e382afe5a2baeb939ce285b9e7998ceb8bb8eba68d42
UHC 藥썹ク墺드ⅹ癌닸릍藥썹ク墺드ⅹ癌닸릍B 11100101101101111011110111100111101010111010111111100111111100101011010111100101101001011010101011100100110111111011010011100110101110001010110011100101101101111011110111100111101010111010111111100111111100101011010111100101101001011010101011100100110111111011010011100110101110001010110001000010 e5b7bde7abafe7f2b5e5a5aae4dfb4e6b8ace5b7bde7abafe7f2b5e5a5aae4dfb4e6b8ac42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)