To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ツ湘、ツソツ偲可偲杠ツ湘、ツソツ偲可偲枌E 11000010100011111100001110100100110000101011111111000010100011101100001110001001110000101000111011000011100111100101100111000010100011111100001110100100110000101011111111000010100011101100001110001001110000101000111011000011100111100110001001000101 c28fc3a4c2bfc28ec389c28ec39e59c28fc3a4c2bfc28ec389c28ec39e6245
EUC-JP ツ湘、ツソツ偲可偲杠ツ湘、ツソツ偲可偲枌E 1000111011000010101111101100010110001110101001001000111011000010100011101011111110001110110000101011110011000101101100101100010010111100110001011101101110111010100011101100001010111110110001011000111010100100100011101100001010001110101111111000111011000010101111001100010110110010110001001011110011000101110110111100001101000101 8ec2bec58ea48ec28ebf8ec2bcc5b2c4bcc5dbba8ec2bec58ea48ec28ebf8ec2bcc5b2c4bcc5dbc345
UTF-8 ツ湘、ツソツ偲可偲杠ツ湘、ツソツ偲可偲枌E 11101111101111101000001011100110101110011001100011101111101111011010010011101111101111101000001011101111101111011011111111101111101111101000001011100101100000011011001011100101100011111010111111100101100000011011001011100110100111011010000011101111101111101000001011100110101110011001100011101111101111011010010011101111101111101000001011101111101111011011111111101111101111101000001011100101100000011011001011100101100011111010111111100101100000011011001011100110100111101000110001000101 efbe82e6b998efbda4efbe82efbdbfefbe82e581b2e58fafe581b2e69da0efbe82e6b998efbda4efbe82efbdbfefbe82e581b2e58fafe581b2e69e8c45
UHC ?湘?????可???湘?????可??E 00111111110111111100111100111111001111110011111100111111001111111100101010100110001111110011111100111111110111111100111100111111001111110011111100111111001111111100101010100110001111110011111101000101 3fdfcf3f3f3f3f3fcaa63f3f3fdfcf3f3f3f3f3fcaa63f3f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)