To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN タス竺タス竺[タス竺タス竺[^ 1111000010110011110000001111000110001110101111011000111010110001111100001011001111000000111100011000111010111101100011101011000101011011111100001011001111000000111100011000111010111101100011101011000111110000101100111100000011110001100011101011110110001110101100010101101101011110 f0b3c0f18ebd8eb1f0b3c0f18ebd8eb15bf0b3c0f18ebd8eb1f0b3c0f18ebd8eb15b5e
EUC-JP ?タ?ス竺?タ?ス竺[?タ?ス竺?タ?ス竺[^ 0011111110001110110000000011111110001110101111011011110010110011001111111000111011000000001111111000111010111101101111001011001101011011001111111000111011000000001111111000111010111101101111001011001100111111100011101100000000111111100011101011110110111100101100110101101101011110 3f8ec03f8ebdbcb33f8ec03f8ebdbcb35b3f8ec03f8ebdbcb33f8ec03f8ebdbcb35b5e
UTF-8 タス竺タス竺[タス竺タス竺[^ 111011101000000110110010111011111011111010000000111011101000010010001001111011111011110110111101111001111010101110111010111011101000000110110010111011111011111010000000111011101000010010001001111011111011110110111101111001111010101110111010010110111110111010000001101100101110111110111110100000001110111010000100100010011110111110111101101111011110011110101011101110101110111010000001101100101110111110111110100000001110111010000100100010011110111110111101101111011110011110101011101110100101101101011110 ee81b2efbe80ee8489efbdbde7abbaee81b2efbe80ee8489efbdbde7abba5bee81b2efbe80ee8489efbdbde7abbaee81b2efbe80ee8489efbdbde7abba5b5e
UHC ????竺????竺[????竺????竺[^ 001111110011111100111111001111111111010111100111001111110011111100111111001111111111010111100111010110110011111100111111001111110011111111110101111001110011111100111111001111110011111111110101111001110101101101011110 3f3f3f3ff5e73f3f3f3ff5e75b3f3f3f3ff5e73f3f3f3ff5e75b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)