To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 閾鷺ゥク繖キ謔 1110100010000111100011011110101110101001101110001110001110000100101101111110011010000010 e8878deba9b8e384b7e682
EUC-JP 閾鷺ゥク繖キ謔 1110111111100111101110101110110110001110101010011000111010111000111001011110010010001110101101111110101111100010 efe7baed8ea98eb8e5e48eb7ebe2
UTF-8 閾鷺ゥク繖キ謔 111010011001011010111110111010011011011110111010111011111011110110101001111011111011110110111000111001111011100110010110111011111011110110110111111010001010110010010100 e996bee9b7baefbda9efbdb8e7b996efbdb7e8ac94
UHC ?鷺????謔 001111111101011011011100001111110011111100111111001111111111100111001100 3fd6dc3f3f3f3ff9cc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)