To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????w}????????w{^ 001111110011111100111111001111110011111100111111001111110011111101110111011111010011111100111111001111110011111100111111001111110011111100111111011101110111101101011110 3f3f3f3f3f3f3f3f777d3f3f3f3f3f3f3f3f777b5e
SJIS-WIN □?淋???桀?w}□?淋???桀?w{^ 100000011010000000111111100101111101001000111111001111110011111110011110011110110011111101110111011111011000000110100000001111111001011111010010001111110011111100111111100111100111101100111111011101110111101101011110 81a03f97d23f3f3f9e7b3f777d81a03f97d23f3f3f9e7b3f777b5e
EUC-JP □?淋??đ桀?w}□?淋??đ桀?w{^ 10100010101000100011111111001110110101000011111100111111100011111010100111000010110110111101110000111111011101110111110110100010101000100011111111001110110101000011111100111111100011111010100111000010110110111101110000111111011101110111101101011110 a2a23fced43f3f8fa9c2dbdc3f777da2a23fced43f3f8fa9c2dbdc3f777b5e
UTF-8 □룶淋헌룵đ桀택w}□룶淋헌룵đ桀택w{^ 111000101001011010100001111010111010001110110110111001101011011110001011111011011001011110001100111010111010001110110101110001001001000111100110101000011000000011101101100000111001110101110111011111011110001010010110101000011110101110100011101101101110011010110111100010111110110110010111100011001110101110100011101101011100010010010001111001101010000110000000111011011000001110011101011101110111101101011110 e296a1eba3b6e6b78bed978ceba3b5c491e6a180ed839d777de296a1eba3b6e6b78bed978ceba3b5c491e6a180ed839d777b5e
UHC □룶淋헌룵đ桀택w}□룶淋헌룵đ桀택w{^ 10100001111000001000111110101011110101111111101011000111111001011000111110101010101010011010001011001011111110101100010111000011011101110111110110100001111000001000111110101011110101111111101011000111111001011000111110101010101010011010001011001011111110101100010111000011011101110111101101011110 a1e08fabd7fac7e58faaa9a2cbfac5c3777da1e08fabd7fac7e58faaa9a2cbfac5c3777b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)