To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ïø´å£çåå³Þïø´å£çåå³Ü^ 111011111111100010110100111001011010001111100111111001011110010110110011110111101110111111111000101101001110010110100011111001111110010111100101101100111101110001011110 eff8b4e5a3e7e5e5b3deeff8b4e5a3e7e5e5b3dc5e
SJIS-WIN ??´?£???????´?£?????^ 00111111001111111000000101001100001111111000000110010010001111110011111100111111001111110011111100111111001111111000000101001100001111111000000110010010001111110011111100111111001111110011111101011110 3f3f814c3f81923f3f3f3f3f3f3f814c3f81923f3f3f3f3f5e
EUC-JP ïø´å£çåå?Þïø´å£çåå?Ü^ 1000111110101011110000011000111110101001110011001010000110101101100011111010101110101001101000011111001010001111101010111010111010001111101010111010100110001111101010111010100100111111100011111010100110110000100011111010101111000001100011111010100111001100101000011010110110001111101010111010100110100001111100101000111110101011101011101000111110101011101010011000111110101011101010010011111110001111101010101110010001011110 8fabc18fa9cca1ad8faba9a1f28fabae8faba98faba93f8fa9b08fabc18fa9cca1ad8faba9a1f28fabae8faba98faba93f8faae45e
UTF-8 ïø´å£çåå³Þïø´å£çåå³Ü^ 1100001110101111110000111011100011000010101101001100001110100101110000101010001111000011101001111100001110100101110000111010010111000010101100111100001110011110110000111010111111000011101110001100001010110100110000111010010111000010101000111100001110100111110000111010010111000011101001011100001010110011110000111001110001011110 c3afc3b8c2b4c3a5c2a3c3a7c3a5c3a5c2b3c39ec3afc3b8c2b4c3a5c2a3c3a7c3a5c3a5c2b3c39c5e
UHC ?ø´?????³Þ?ø´?????³?^ 00111111101010011010101010100010101001010011111100111111001111110011111100111111101010011111100010101000101011010011111110101001101010101010001010100101001111110011111100111111001111110011111110101001111110000011111101011110 3fa9aaa2a53f3f3f3f3fa9f8a8ad3fa9aaa2a53f3f3f3f3fa9f83f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)