To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セャ質爾捨セュ痔治識治釈セャ 10111110101011001000111010111111100011101010001010001110110011001011111010101101100011101010010010001110101000011000111010101111100011101010000110001110110111111011111010101100 beac8ebf8ea28eccbead8ea48ea18eaf8ea18edfbeac
EUC-JP セャ質爾捨セュ痔治識治釈セャ 10001110101111101000111010101100101111001100000110111100101001001011110011001110100011101011111010001110101011011011110010100110101111001010001110111100101100011011110010100011101111001110000110001110101111101000111010101100 8ebe8eacbcc1bca4bcce8ebe8eadbca6bca3bcb1bca3bce18ebe8eac
UTF-8 セャ質爾捨セュ痔治識治釈セャ 111011111011110110111110111011111011110110101100111010001011001110101010111001111000100010111110111001101000110110101000111011111011110110111110111011111011110110101101111001111001011110010100111001101011001010111011111010001010110110011000111001101011001010111011111010011000011110001000111011111011110110111110111011111011110110101100 efbdbeefbdace8b3aae788bee68da8efbdbeefbdade79794e6b2bbe8ad98e6b2bbe98788efbdbeefbdac
UHC ??質爾捨??痔治識治??? 001111110011111111110010111101011110110010110011110111101101011100111111001111111111011011000000111101101011110111100011110110111111011010111101001111110011111100111111 3f3ff2f5ecb3ded73f3ff6c0f6bde3dbf6bd3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)