To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 éå·iéå·iB 1000111111101001111001011011011101101001100011111110100111100101101101110110100101000010 8fe9e5b7698fe9e5b76942
SJIS-WIN ????i????iB 0011111100111111001111110011111101101001001111110011111100111111001111110110100101000010 3f3f3f3f693f3f3f3f6942
EUC-JP ?éå?i?éå?iB 00111111100011111010101110110001100011111010101110101001001111110110100100111111100011111010101110110001100011111010101110101001001111110110100101000010 3f8fabb18faba93f693f8fabb18faba93f6942
UTF-8 éå·iéå·iB 11000010100011111100001110101001110000111010010111000010101101110110100111000010100011111100001110101001110000111010010111000010101101110110100101000010 c28fc3a9c3a5c2b769c28fc3a9c3a5c2b76942
UHC ???·i???·iB 00111111001111110011111110100001101001000110100100111111001111110011111110100001101001000110100101000010 3f3f3fa1a4693f3f3fa1a46942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)