To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?G 0011111101000111 3f47
SJIS-WIN 茉G 111001001001110101000111 e49d47
EUC-JP 茉G 111001111111110101000111 e7fd47
UTF-8 茉G 11101000100011001000100101000111 e88c8947
UHC 茉G 110110001100100101000111 d8c947

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)