To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN セャ紗セ、篠セォ偲セュ宍セャ紗セ、篠セォ偲セュ宍B 101111101010110010001110110100011011111010100100100011101100001010111110101010111000111011000011101111101010110110001110101100111011111010101100100011101101000110111110101001001000111011000010101111101010101110001110110000111011111010101101100011101011001101000010 beac8ed1bea48ec2beab8ec3bead8eb3beac8ed1bea48ec2beab8ec3bead8eb342
EUC-JP セャ紗セ、篠セォ偲セュ宍セャ紗セ、篠セォ偲セュ宍B 10001110101111101000111010101100101111001101001110001110101111101000111010100100101111001100010010001110101111101000111010101011101111001100010110001110101111101000111010101101101111001011010110001110101111101000111010101100101111001101001110001110101111101000111010100100101111001100010010001110101111101000111010101011101111001100010110001110101111101000111010101101101111001011010101000010 8ebe8eacbcd38ebe8ea4bcc48ebe8eabbcc58ebe8eadbcb58ebe8eacbcd38ebe8ea4bcc48ebe8eabbcc58ebe8eadbcb542
UTF-8 セャ紗セ、篠セォ偲セュ宍セャ紗セ、篠セォ偲セュ宍B 11101111101111011011111011101111101111011010110011100111101101001001011111101111101111011011111011101111101111011010010011100111101011111010000011101111101111011011111011101111101111011010101111100101100000011011001011101111101111011011111011101111101111011010110111100101101011101000110111101111101111011011111011101111101111011010110011100111101101001001011111101111101111011011111011101111101111011010010011100111101011111010000011101111101111011011111011101111101111011010101111100101100000011011001011101111101111011011111011101111101111011010110111100101101011101000110101000010 efbdbeefbdace7b497efbdbeefbda4e7afa0efbdbeefbdabe581b2efbdbeefbdade5ae8defbdbeefbdace7b497efbdbeefbda4e7afa0efbdbeefbdabe581b2efbdbeefbdade5ae8d42
UHC ??紗??篠????????紗??篠??????B 0011111100111111110111101110100100111111001111111110000111000110001111110011111100111111001111110011111100111111001111110011111111011110111010010011111100111111111000011100011000111111001111110011111100111111001111110011111101000010 3f3fdee93f3fe1c63f3f3f3f3f3f3f3fdee93f3fe1c63f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)