To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ëùÕÊëùÕÊB 111010111111100111010101110010101110101111111001110101011100101001000010 ebf9d5caebf9d5ca42
SJIS-WIN ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
EUC-JP ëùÕÊëùÕÊB 10001111101010111011001110001111101010111110001110001111101010101101100010001111101010101011010010001111101010111011001110001111101010111110001110001111101010101101100010001111101010101011010001000010 8fabb38fabe38faad88faab48fabb38fabe38faad88faab442
UTF-8 ëùÕÊëùÕÊB 1100001110101011110000111011100111000011100101011100001110001010110000111010101111000011101110011100001110010101110000111000101001000010 c3abc3b9c395c38ac3abc3b9c395c38a42
UHC ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)