To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 簇?鬱賂??雋 1110001011000110001111111001111101010100100110000100011100111111001111111110100010110010 e2c63f9f5498473f3fe8b2
EUC-JP 簇?鬱賂??雋 1110010011001000001111111101110110110101110011111010100000111111001111111111000010110100 e4c83fddb5cfa83f3ff0b4
UTF-8 簇렫鬱賂렰렣雋 111001111011000010000111111010111010000010101011111010011010110010110001111010001011001110000010111010111010000010110000111010111010000010100011111010011001101110001011 e7b087eba0abe9acb1e8b382eba0b0eba0a3e99b8b
UHC 簇렫鬱賂렰렣雋 1111000011101010100011101011100111101010101001101101011011110001100011101011110110001110101101001111000111100110 f0ea8eb9eaa6d6f18ebd8eb4f1e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)