To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?j 0011111101101010 3f6a
SJIS-WIN 楔j 100111101011011001101010 9eb66a
EUC-JP 楔j 110111001011100001101010 dcb86a
UTF-8 楔j 11100110101001011001010001101010 e6a5946a
UHC 楔j 111000001101101101101010 e0db6a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)