To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????M}????????M{^ 001111110011111100111111001111110011111100111111001111110011111101001101011111010011111100111111001111110011111100111111001111110011111100111111010011010111101101011110 3f3f3f3f3f3f3f3f4d7d3f3f3f3f3f3f3f3f4d7b5e
SJIS-WIN □?淋???桀?M}□?淋???桀?M{^ 100000011010000000111111100101111101001000111111001111110011111110011110011110110011111101001101011111011000000110100000001111111001011111010010001111110011111100111111100111100111101100111111010011010111101101011110 81a03f97d23f3f3f9e7b3f4d7d81a03f97d23f3f3f9e7b3f4d7b5e
EUC-JP □?淋??đ桀?M}□?淋??đ桀?M{^ 10100010101000100011111111001110110101000011111100111111100011111010100111000010110110111101110000111111010011010111110110100010101000100011111111001110110101000011111100111111100011111010100111000010110110111101110000111111010011010111101101011110 a2a23fced43f3f8fa9c2dbdc3f4d7da2a23fced43f3f8fa9c2dbdc3f4d7b5e
UTF-8 □룶淋헌룵đ桀택M}□룶淋헌룵đ桀택M{^ 111000101001011010100001111010111010001110110110111001101011011110001011111011011001011110001100111010111010001110110101110001001001000111100110101000011000000011101101100000111001110101001101011111011110001010010110101000011110101110100011101101101110011010110111100010111110110110010111100011001110101110100011101101011100010010010001111001101010000110000000111011011000001110011101010011010111101101011110 e296a1eba3b6e6b78bed978ceba3b5c491e6a180ed839d4d7de296a1eba3b6e6b78bed978ceba3b5c491e6a180ed839d4d7b5e
UHC □룶淋헌룵đ桀택M}□룶淋헌룵đ桀택M{^ 10100001111000001000111110101011110101111111101011000111111001011000111110101010101010011010001011001011111110101100010111000011010011010111110110100001111000001000111110101011110101111111101011000111111001011000111110101010101010011010001011001011111110101100010111000011010011010111101101011110 a1e08fabd7fac7e58faaa9a2cbfac5c34d7da1e08fabd7fac7e58faaa9a2cbfac5c34d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)