To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 寥?低短寥?低短^ 100110111000110000111111100100101110000110010010010110101001101110001100001111111001001011100001100100100101101001011110 9b8c3f92e1925a9b8c3f92e1925a5e
EUC-JP 寥?低短寥?低短^ 110101011110110000111111110001001110001111000011101110111101010111101100001111111100010011100011110000111011101101011110 d5ec3fc4e3c3bbd5ec3fc4e3c3bb5e
UTF-8 寥렔低短寥렔低短^ 11100101101011111010010111101011101000001001010011100100101111011000111011100111100111111010110111100101101011111010010111101011101000001001010011100100101111011000111011100111100111111010110101011110 e5afa5eba094e4bd8ee79fade5afa5eba094e4bd8ee79fad5e
UHC 寥렔低短寥렔低短^ 1110100011101111100011101010100111101110101110001101001110101101111010001110111110001110101010011110111010111000110100111010110101011110 e8ef8ea9eeb8d3ade8ef8ea9eeb8d3ad5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)