To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???宥??猿??B 001111110011111100111111100101110100011100111111001111111000100110001110001111110011111101000010 3f3f3f97473f3f898e3f3f42
EUC-JP 孼??宥??猿??B 1000111110111010110000110011111100111111110011011010100000111111001111111011000111101110001111110011111101000010 8fbac33f3fcda83f3fb1ee3f3f42
UTF-8 孼뽮퍔宥욄룚猿뗫떈B 11100101101011011011110011101011101111011010111011101101100011011001010011100101101011101010010111101100100110101000010011101011101000111001101011100111100011001011111111101011100101111010101111101011100101101000100001000010 e5adbcebbdaeed8d94e5aea5ec9a84eba39ae78cbfeb97abeb968842
UHC 孼뽮퍔宥욄룚猿뗫떈B 11100101111011011001011011101010101110111000101111101010111010011001111011100110100011111001011011101010101110111000101111101011100010111001111001000010 e5ed96eabb8beae99ee68f96eabb8beb8b9e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)