To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????Zg??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010110100110011100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5a673f3f3f3f3f3f3f
SJIS-WIN テサテ崚」テ、ツ、テ個渉サテサZgツ、テ個ウツ」 11000011101110111100001110011011110000111010001111000011101001001100001010100100110000111000110011000010100011111100001010111011110000111011101101011010011001111100001010100100110000111000110011000010101100111100001010100011 c3bbc39bc3a3c3a4c2a4c38cc28fc2bbc3bb5a67c2a4c38cc2b3c2a3
EUC-JP テサテ崚」テ、ツ、テ個渉サテサZgツ、テ個ウツ」 10001110110000111000111010111011100011101100001111010110110001011000111010100011100011101100001110001110101001001000111011000010100011101010010010001110110000111011100011000100101111101100010010001110101110111000111011000011100011101011101101011010011001111000111011000010100011101010010010001110110000111011100011000100100011101011001110001110110000101000111010100011 8ec38ebb8ec3d6c58ea38ec38ea48ec28ea48ec3b8c4bec48ebb8ec38ebb5a678ec28ea48ec3b8c48eb38ec28ea3
UTF-8 テサテ崚」テ、ツ、テ個渉サテサZgツ、テ個ウツ」 1110111110111110100000111110111110111101101110111110111110111110100000111110010110110100100110101110111110111101101000111110111110111110100000111110111110111101101001001110111110111110100000101110111110111101101001001110111110111110100000111110010110000000100010111110011010111000100010011110111110111101101110111110111110111110100000111110111110111101101110110101101001100111111011111011111010000010111011111011110110100100111011111011111010000011111001011000000010001011111011111011110110110011111011111011111010000010111011111011110110100011 efbe83efbdbbefbe83e5b49aefbda3efbe83efbda4efbe82efbda4efbe83e5808be6b889efbdbbefbe83efbdbb5a67efbe82efbda4efbe83e5808befbdb3efbe82efbda3
UHC ??????????個????Zg???個??? 0011111100111111001111110011111100111111001111110011111100111111001111110011111111001011110000010011111100111111001111110011111101011010011001110011111100111111001111111100101111000001001111110011111100111111 3f3f3f3f3f3f3f3f3f3fcbc13f3f3f3f5a673f3f3fcbc13f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)