To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN タス耳嫉社蒔タス耳嫉社蒔^ 111100001010111011000000111100011000111010111101100011101010100011110000110000011000111010111001100011101101000010001110101010101111000010101110110000001111000110001110101111011000111010101000111100001100000110001110101110011000111011010000100011101010101001011110 f0aec0f18ebd8ea8f0c18eb98ed08eaaf0aec0f18ebd8ea8f0c18eb98ed08eaa5e
EUC-JP ?タ?ス耳?嫉社蒔?タ?ス耳?嫉社蒔^ 00111111100011101100000000111111100011101011110110111100101010100011111110111100101110111011110011010010101111001010110000111111100011101100000000111111100011101011110110111100101010100011111110111100101110111011110011010010101111001010110001011110 3f8ec03f8ebdbcaa3fbcbbbcd2bcac3f8ec03f8ebdbcaa3fbcbbbcd2bcac5e
UTF-8 タス耳嫉社蒔タス耳嫉社蒔^ 11101110100000011010110111101111101111101000000011101110100001001000100111101111101111011011110111101000100000001011001111101110100000101000000011100101101010111000100111100111101001001011111011101000100100101001010011101110100000011010110111101111101111101000000011101110100001001000100111101111101111011011110111101000100000001011001111101110100000101000000011100101101010111000100111100111101001001011111011101000100100101001010001011110 ee81adefbe80ee8489efbdbde880b3ee8280e5ab89e7a4bee89294ee81adefbe80ee8489efbdbde880b3ee8280e5ab89e7a4bee892945e
UHC ????耳?嫉社蒔????耳?嫉社蒔^ 001111110011111100111111001111111110110010111100001111111111001011101100110111101110010011100011110010000011111100111111001111110011111111101100101111000011111111110010111011001101111011100100111000111100100001011110 3f3f3f3fecbc3ff2ecdee4e3c83f3f3f3fecbc3ff2ecdee4e3c85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)