To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???i???iB 001111110011111100111111011010010011111100111111001111110110100101000010 3f3f3f693f3f3f6942
SJIS-WIN 升??i升??iB 1000111110100001001111110011111101101001100011111010000100111111001111110110100101000010 8fa13f3f698fa13f3f6942
EUC-JP 升??i升??iB 1011111010100011001111110011111101101001101111101010001100111111001111110110100101000010 bea33f3f69bea33f3f6942
UTF-8 升릯꽑i升릯꽑iB 111001011000110110000111111010111010011010101111111010101011110110010001011010011110010110001101100001111110101110100110101011111110101010111101100100010110100101000010 e58d87eba6afeabd9169e58d87eba6afeabd916942
UHC 升릯꽑i升릯꽑iB 111000111010111010010000100011111000010010100000011010011110001110101110100100001000111110000100101000000110100101000010 e3ae908f84a069e3ae908f84a06942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)