To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雲?d?懇?《癌臣?究雲?d?懇?《癌臣?究^ 10001001010111110011111110000010100001000011111110001101101001110011111110000001011100111000101011100000100100000110001000111111100010111000011010001001010111110011111110000010100001000011111110001101101001110011111110000001011100111000101011100000100100000110001000111111100010111000011001011110 895f3f82843f8da73f81738ae090623f8b86895f3f82843f8da73f81738ae090623f8b865e
EUC-JP 雲?d?懇?《癌臣?究雲?d?懇?《癌臣?究^ 10110001110000000011111110100011111001000011111110111010101010010011111110100001110101001011010011100010101111111100001100111111101101011110011010110001110000000011111110100011111001000011111110111010101010010011111110100001110101001011010011100010101111111100001100111111101101011110011001011110 b1c03fa3e43fbaa93fa1d4b4e2bfc33fb5e6b1c03fa3e43fbaa93fa1d4b4e2bfc33fb5e65e
UTF-8 雲뜹d뤋懇咽《癌臣뽈究雲뜹d뤋懇咽《癌臣뽈究^ 11101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101111100110100001111000011111101111101001101001111011100011100000001000101011100111100110011000110011101000100001111010001111101011101111011000100011100111101010011011011011101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101111100110100001111000011111101111101001101001111011100011100000001000101011100111100110011000110011101000100001111010001111101011101111011000100011100111101010011011011001011110 e99bb2eb9cb9efbd84eba48be68787efa69ee3808ae7998ce887a3ebbd88e7a9b6e99bb2eb9cb9efbd84eba48be68787efa69ee3808ae7998ce887a3ebbd88e7a9b65e
UHC 雲뜹d뤋懇咽《癌臣뽈究雲뜹d뤋懇咽《癌臣뽈究^ 111010101010001110110110111001011010001111100100100011111011101111001010110100001110011011101100101000011011011011100100110111111110001111101101101110111100101011001111101111001110101010100011101101101110010110100011111001001000111110111011110010101101000011100110111011001010000110110110111001001101111111100011111011011011101111001010110011111011110001011110 eaa3b6e5a3e48fbbcad0e6eca1b6e4dfe3edbbcacfbceaa3b6e5a3e48fbbcad0e6eca1b6e4dfe3edbbcacfbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)