To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ãù‚Cãù‚C[ãù‚Cãù‚C[^ 111000111111100110000010100011111000111101000011111000111111100110000010100011111000111101000011010110111110001111111001100000101000111110001111010000111110001111111001100000101000111110001111010000110101101101011110 e3f9828f8f43e3f9828f8f435be3f9828f8f43e3f9828f8f435b5e
SJIS-WIN ?????C?????C[?????C?????C[^ 001111110011111100111111001111110011111101000011001111110011111100111111001111110011111101000011010110110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111010000110101101101011110 3f3f3f3f3f433f3f3f3f3f435b3f3f3f3f3f433f3f3f3f3f435b5e
EUC-JP ãù???Cãù???C[ãù???Cãù???C[^ 10001111101010111010101010001111101010111110001100111111001111110011111101000011100011111010101110101010100011111010101111100011001111110011111100111111010000110101101110001111101010111010101010001111101010111110001100111111001111110011111101000011100011111010101110101010100011111010101111100011001111110011111100111111010000110101101101011110 8fabaa8fabe33f3f3f438fabaa8fabe33f3f3f435b8fabaa8fabe33f3f3f438fabaa8fabe33f3f3f435b5e
UTF-8 ãù‚Cãù‚C[ãù‚Cãù‚C[^ 1100001110100011110000111011100111000010100000101100001010001111110000101000111101000011110000111010001111000011101110011100001010000010110000101000111111000010100011110100001101011011110000111010001111000011101110011100001010000010110000101000111111000010100011110100001111000011101000111100001110111001110000101000001011000010100011111100001010001111010000110101101101011110 c3a3c3b9c282c28fc28f43c3a3c3b9c282c28fc28f435bc3a3c3b9c282c28fc28f43c3a3c3b9c282c28fc28f435b5e
UHC ?????C?????C[?????C?????C[^ 001111110011111100111111001111110011111101000011001111110011111100111111001111110011111101000011010110110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111010000110101101101011110 3f3f3f3f3f433f3f3f3f3f435b3f3f3f3f3f433f3f3f3f3f435b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)