To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セャ実セォ治治釈セャ紗樟冐タセォ治治爵 1011111010101100100011101100000010111110101010111000111010100001100011101010000110001110110111111011111010101100100011101101000110001111101111101110001111101100110000001011111010101011100011101010000110001110101000011000111011011101 beac8ec0beab8ea18ea18edfbeac8ed18fbee3ecc0beab8ea18ea18edd
EUC-JP セャ実セォ治治釈セャ紗樟冐タセォ治治爵 1000111010111110100011101010110010111100110000101000111010111110100011101010101110111100101000111011110010100011101111001110000110001110101111101000111010101100101111001101001110111110110000001110011011101110100011101100000010001110101111101000111010101011101111001010001110111100101000111011110011011111 8ebe8eacbcc28ebe8eabbca3bca3bce18ebe8eacbcd3bec0e6ee8ec08ebe8eabbca3bca3bcdf
UTF-8 セャ実セォ治治釈セャ紗樟冐タセォ治治爵 111011111011110110111110111011111011110110101100111001011010111010011111111011111011110110111110111011111011110110101011111001101011001010111011111001101011001010111011111010011000011110001000111011111011110110111110111011111011110110101100111001111011010010010111111001101010100010011111111001011000011010010000111011111011111010000000111011111011110110111110111011111011110110101011111001101011001010111011111001101011001010111011111001111000100010110101 efbdbeefbdace5ae9fefbdbeefbdabe6b2bbe6b2bbe98788efbdbeefbdace7b497e6a89fe58690efbe80efbdbeefbdabe6b2bbe6b2bbe788b5
UHC ?????治治???紗樟????治治爵 0011111100111111001111110011111100111111111101101011110111110110101111010011111100111111001111111101111011101001111011011110100100111111001111110011111100111111111101101011110111110110101111011110110111001001 3f3f3f3f3ff6bdf6bd3f3f3fdee9ede93f3f3f3ff6bdf6bdedc9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)