To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B??????????B^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001011110 3f3f3f3f3f3f3f3f3f3f423f3f3f3f3f3f3f3f3f3f425e
SJIS-WIN 症マシッ柴叱謝嫉B症マシッ柴叱謝嫉B^ 10001111110001111100111110111100101011111111001011101111100011101100010011110001111011101000111010110110100011101101001110001110101110010100001010001111110001111100111110111100101011111111001011101111100011101100010011110001111011101000111010110110100011101101001110001110101110010100001001011110 8fc7cfbcaff2ef8ec4f1ee8eb68ed38eb9428fc7cfbcaff2ef8ec4f1ee8eb68ed38eb9425e
EUC-JP 症マシッ?柴?叱謝嫉B症マシッ?柴?叱謝嫉B^ 101111101100100110001110110011111000111010111100100011101010111100111111101111001100011000111111101111001011100010111100110101011011110010111011010000101011111011001001100011101100111110001110101111001000111010101111001111111011110011000110001111111011110010111000101111001101010110111100101110110100001001011110 bec98ecf8ebc8eaf3fbcc63fbcb8bcd5bcbb42bec98ecf8ebc8eaf3fbcc63fbcb8bcd5bcbb425e
UTF-8 症マシッ柴叱謝嫉B症マシッ柴叱謝嫉B^ 111001111001011110000111111011111011111010001111111011111011110110111100111011111011110110101111111011101000100010100110111001101001111110110100111011101000010110101001111001011000111110110001111010001010110010011101111001011010101110001001010000101110011110010111100001111110111110111110100011111110111110111101101111001110111110111101101011111110111010001000101001101110011010011111101101001110111010000101101010011110010110001111101100011110100010101100100111011110010110101011100010010100001001011110 e79787efbe8fefbdbcefbdafee88a6e69fb4ee85a9e58fb1e8ac9de5ab8942e79787efbe8fefbdbcefbdafee88a6e69fb4ee85a9e58fb1e8ac9de5ab89425e
UHC 症????柴?叱謝嫉B症????柴?叱謝嫉B^ 111100011111100000111111001111110011111100111111111000111100001100111111111100101110101011011110111100111111001011101100010000101111000111111000001111110011111100111111001111111110001111000011001111111111001011101010110111101111001111110010111011000100001001011110 f1f83f3f3f3fe3c33ff2eadef3f2ec42f1f83f3f3f3fe3c33ff2eadef3f2ec425e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)