To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????WD????????WD^ 001111110011111100111111001111110011111100111111001111110011111101010111010001000011111100111111001111110011111100111111001111110011111100111111010101110100010001011110 3f3f3f3f3f3f3f3f57443f3f3f3f3f3f3f3f57445e
SJIS-WIN □?淋???桀?WD□?淋???桀?WD^ 100000011010000000111111100101111101001000111111001111110011111110011110011110110011111101010111010001001000000110100000001111111001011111010010001111110011111100111111100111100111101100111111010101110100010001011110 81a03f97d23f3f3f9e7b3f574481a03f97d23f3f3f9e7b3f57445e
EUC-JP □?淋??đ桀?WD□?淋??đ桀?WD^ 10100010101000100011111111001110110101000011111100111111100011111010100111000010110110111101110000111111010101110100010010100010101000100011111111001110110101000011111100111111100011111010100111000010110110111101110000111111010101110100010001011110 a2a23fced43f3f8fa9c2dbdc3f5744a2a23fced43f3f8fa9c2dbdc3f57445e
UTF-8 □룶淋헌룵đ桀택WD□룶淋헌룵đ桀택WD^ 111000101001011010100001111010111010001110110110111001101011011110001011111011011001011110001100111010111010001110110101110001001001000111100110101000011000000011101101100000111001110101010111010001001110001010010110101000011110101110100011101101101110011010110111100010111110110110010111100011001110101110100011101101011100010010010001111001101010000110000000111011011000001110011101010101110100010001011110 e296a1eba3b6e6b78bed978ceba3b5c491e6a180ed839d5744e296a1eba3b6e6b78bed978ceba3b5c491e6a180ed839d57445e
UHC □룶淋헌룵đ桀택WD□룶淋헌룵đ桀택WD^ 10100001111000001000111110101011110101111111101011000111111001011000111110101010101010011010001011001011111110101100010111000011010101110100010010100001111000001000111110101011110101111111101011000111111001011000111110101010101010011010001011001011111110101100010111000011010101110100010001011110 a1e08fabd7fac7e58faaa9a2cbfac5c35744a1e08fabd7fac7e58faaa9a2cbfac5c357445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)