To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?I 0011111101001001 3f49
SJIS-WIN 崟I 100110111011111001001001 9bbe49
EUC-JP 崟I 110101101100000001001001 d6c049
UTF-8 崟I 11100101101101001001111101001001 e5b49f49
UHC ?I 0011111101001001 3f49

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)