To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 滬鼇報滬鼇報^ 10011111111101001110101010000111100101011111000110011111111101001110101010000111100101011111000101011110 9ff4ea8795f19ff4ea8795f15e
EUC-JP 滬鼇報滬鼇報^ 11011110111101101111001111100111110010101111001111011110111101101111001111100111110010101111001101011110 def6f3e7caf3def6f3e7caf35e
UTF-8 滬鼇報滬鼇報^ 11100110101110111010110011101001101111001000011111100101101000001011000111100110101110111010110011101001101111001000011111100101101000001011000101011110 e6bbace9bc87e5a0b1e6bbace9bc87e5a0b15e
UHC ?鼇報?鼇報^ 0011111111101000101010001101110011000011001111111110100010101000110111001100001101011110 3fe8a8dcc33fe8a8dcc35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)