To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 仲?鬱賂??雋 1001001010000111001111111001111101010100100110000100011100111111001111111110100010110010 92873f9f5498473f3fe8b2
EUC-JP 仲?鬱賂??雋 1100001111100111001111111101110110110101110011111010100000111111001111111111000010110100 c3e73fddb5cfa83f3ff0b4
UTF-8 仲렫鬱賂렰렣雋 111001001011101110110010111010111010000010101011111010011010110010110001111010001011001110000010111010111010000010110000111010111010000010100011111010011001101110001011 e4bbb2eba0abe9acb1e8b382eba0b0eba0a3e99b8b
UHC 仲렫鬱賂렰렣雋 1111000111101010100011101011100111101010101001101101011011110001100011101011110110001110101101001111000111100110 f1ea8eb9eaa6d6f18ebd8eb4f1e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)