To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????xi????????xiB 001111110011111100111111001111110011111100111111001111110011111101111000011010010011111100111111001111110011111100111111001111110011111100111111011110000110100101000010 3f3f3f3f3f3f3f3f78693f3f3f3f3f3f3f3f786942
SJIS-WIN シト縞芝屡芝シ・xiシト縞芝屡芝シ・xiB 1011110011000100100011101100100010001110110001011000111011000110100011101100010110111100101001010111100001101001101111001100010010001110110010001000111011000101100011101100011010001110110001011011110010100101011110000110100101000010 bcc48ec88ec58ec68ec5bca57869bcc48ec88ec58ec68ec5bca5786942
EUC-JP シト縞芝屡芝シ・xiシト縞芝屡芝シ・xiB 10001110101111001000111011000100101111001100101010111100110001111011110011001000101111001100011110001110101111001000111010100101011110000110100110001110101111001000111011000100101111001100101010111100110001111011110011001000101111001100011110001110101111001000111010100101011110000110100101000010 8ebc8ec4bccabcc7bcc8bcc78ebc8ea578698ebc8ec4bccabcc7bcc8bcc78ebc8ea5786942
UTF-8 シト縞芝屡芝シ・xiシト縞芝屡芝シ・xiB 1110111110111101101111001110111110111110100001001110011110111000100111101110100010001010100111011110010110110001101000011110100010001010100111011110111110111101101111001110111110111101101001010111100001101001111011111011110110111100111011111011111010000100111001111011100010011110111010001000101010011101111001011011000110100001111010001000101010011101111011111011110110111100111011111011110110100101011110000110100101000010 efbdbcefbe84e7b89ee88a9de5b1a1e88a9defbdbcefbda57869efbdbcefbe84e7b89ee88a9de5b1a1e88a9defbdbcefbda5786942
UHC ??縞芝?芝??xi??縞芝?芝??xiB 001111110011111111111011110101101111001010111001001111111111001010111001001111110011111101111000011010010011111100111111111110111101011011110010101110010011111111110010101110010011111100111111011110000110100101000010 3f3ffbd6f2b93ff2b93f3f78693f3ffbd6f2b93ff2b93f3f786942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)