To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セャ治治爵樵ィ篠セャ萬セャ治治尺?讀篠セ 10111110101011001000111010100001100011101010000110001110110111011000111110111111101010001000111011000010101111101010110011100100110111011011111010101100100011101010000110001110101000011000111011011010001111111110011010100100100011101100001010111110 beac8ea18ea18edd8fbfa88ec2beace4ddbeac8ea18ea18eda3fe6a48ec2be
EUC-JP セャ治治爵樵ィ篠セャ萬セャ治治尺鑃讀篠セ 1000111010111110100011101010110010111100101000111011110010100011101111001101111110111110110000011000111010101000101111001100010010001110101111101000111010101100111010001101111110001110101111101000111010101100101111001010001110111100101000111011110011011100100011111110010111101001111011001010011010111100110001001000111010111110 8ebe8eacbca3bca3bcdfbec18ea8bcc48ebe8eace8df8ebe8eacbca3bca3bcdc8fe5e9eca6bcc48ebe
UTF-8 セャ治治爵樵ィ篠セャ萬セャ治治尺鑃讀篠セ 111011111011110110111110111011111011110110101100111001101011001010111011111001101011001010111011111001111000100010110101111001101010100010110101111011111011110110101000111001111010111110100000111011111011110110111110111011111011110110101100111010001001000010101100111011111011110110111110111011111011110110101100111001101011001010111011111001101011001010111011111001011011000010111010111010011001000110000011111010001010111010000000111001111010111110100000111011111011110110111110 efbdbeefbdace6b2bbe6b2bbe788b5e6a8b5efbda8e7afa0efbdbeefbdace890acefbdbeefbdace6b2bbe6b2bbe5b0bae99183e8ae80e7afa0efbdbe
UHC ??治治爵樵?篠??萬??治治尺?讀篠? 00111111001111111111011010111101111101101011110111101101110010011111010110100011001111111110000111000110001111110011111111011000101111110011111100111111111101101011110111110110101111011111010010101001001111111101010011000001111000011100011000111111 3f3ff6bdf6bdedc9f5a33fe1c63f3fd8bf3f3ff6bdf6bdf4a93fd4c1e1c63f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)