To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??S??????U}??S??????U{^ 0011111100111111010100110011111100111111001111110011111100111111001111110101010101111101001111110011111101010011001111110011111100111111001111110011111100111111010101010111101101011110 3f3f533f3f3f3f3f3f557d3f3f533f3f3f3f3f3f557b5e
SJIS-WIN テサSツ妥ョツつャU}テサSツ妥ョツつャU{^ 110000111011101101010011110000101001000111000011101011101100001010000010110000101010110001010101011111011100001110111011010100111100001010010001110000111010111011000010100000101100001010101100010101010111101101011110 c3bb53c291c3aec282c2ac557dc3bb53c291c3aec282c2ac557b5e
EUC-JP テサSツ妥ョツつャU}テサSツ妥ョツつャU{^ 100011101100001110001110101110110101001110001110110000101100001011000101100011101010111010001110110000101010010011000100100011101010110001010101011111011000111011000011100011101011101101010011100011101100001011000010110001011000111010101110100011101100001010100100110001001000111010101100010101010111101101011110 8ec38ebb538ec2c2c58eae8ec2a4c48eac557d8ec38ebb538ec2c2c58eae8ec2a4c48eac557b5e
UTF-8 テサSツ妥ョツつャU}テサSツ妥ョツつャU{^ 11101111101111101000001111101111101111011011101101010011111011111011111010000010111001011010011010100101111011111011110110101110111011111011111010000010111000111000000110100100111011111011110110101100010101010111110111101111101111101000001111101111101111011011101101010011111011111011111010000010111001011010011010100101111011111011110110101110111011111011111010000010111000111000000110100100111011111011110110101100010101010111101101011110 efbe83efbdbb53efbe82e5a6a5efbdaeefbe82e381a4efbdac557defbe83efbdbb53efbe82e5a6a5efbdaeefbe82e381a4efbdac557b5e
UHC ??S?妥??つ?U}??S?妥??つ?U{^ 001111110011111101010011001111111111011011100110001111110011111110101010110001000011111101010101011111010011111100111111010100110011111111110110111001100011111100111111101010101100010000111111010101010111101101011110 3f3f533ff6e63f3faac43f557d3f3f533ff6e63f3faac43f557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)