To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セャ蔡蒔治セュ痔爾射樟蒟黑ェ治セュ痔爾射シ 10111110101011001110010011101111100011101010101010001110101000011011111010101101100011101010010010001110101000101000111011001011100011111011111011100100111001011111110001001011101010101000111010100001101111101010110110001110101001001000111010100010100011101100101110111100 beace4ef8eaa8ea1bead8ea48ea28ecb8fbee4e5fc4baa8ea1bead8ea48ea28ecbbc
EUC-JP セャ蔡蒔治セュ痔爾射樟蒟?ェ治セュ痔爾射シ 1000111010111110100011101010110011101000111100011011110010101100101111001010001110001110101111101000111010101101101111001010011010111100101001001011110011001101101111101100000011101000111001110011111110001110101010101011110010100011100011101011111010001110101011011011110010100110101111001010010010111100110011011000111010111100 8ebe8eace8f1bcacbca38ebe8eadbca6bca4bccdbec0e8e73f8eaabca38ebe8eadbca6bca4bccd8ebc
UTF-8 セャ蔡蒔治セュ痔爾射樟蒟黑ェ治セュ痔爾射シ 111011111011110110111110111011111011110110101100111010001001010010100001111010001001001010010100111001101011001010111011111011111011110110111110111011111011110110101101111001111001011110010100111001111000100010111110111001011011000010000100111001101010100010011111111010001001001010011111111010011011101110010001111011111011110110101010111001101011001010111011111011111011110110111110111011111011110110101101111001111001011110010100111001111000100010111110111001011011000010000100111011111011110110111100 efbdbeefbdace894a1e89294e6b2bbefbdbeefbdade79794e788bee5b084e6a89fe8929fe9bb91efbdaae6b2bbefbdbeefbdade79794e788bee5b084efbdbc
UHC ??蔡蒔治??痔爾射樟?黑?治??痔爾射? 001111110011111111110011111110011110001111001000111101101011110100111111001111111111011011000000111011001011001111011110110100101110110111101001001111111111110111011001001111111111011010111101001111110011111111110110110000001110110010110011110111101101001000111111 3f3ff3f9e3c8f6bd3f3ff6c0ecb3ded2ede93ffdd93ff6bd3f3ff6c0ecb3ded23f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)