To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æD 1110011001000100 e644
SJIS-WIN ?D 0011111101000100 3f44
EUC-JP æD 10001111101010011100000101000100 8fa9c144
UTF-8 æD 110000111010011001000100 c3a644
UHC æD 101010011010000101000100 a9a144

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)