To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æC 1110011001000011 e643
SJIS-WIN ?C 0011111101000011 3f43
EUC-JP æC 10001111101010011100000101000011 8fa9c143
UTF-8 æC 110000111010011001000011 c3a643
UHC æC 101010011010000101000011 a9a143

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)