To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????L????????????????L^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001100001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110001011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4c3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4c5e
SJIS-WIN テ、テ敖渉渉蹴テ、ツェツ湘ュツ酬Lテ、テ敖渉渉蹴テ、ツェツ湘ュツ酬L^ 1100001110100100110000111001110111000010100011111100001010001111110000101000111101010010110000111010010011000010101010101100001010001111110000111010110111000010100011110101011001001100110000111010010011000011100111011100001010001111110000101000111111000010100011110101001011000011101001001100001010101010110000101000111111000011101011011100001010001111010101100100110001011110 c3a4c39dc28fc28fc28f52c3a4c2aac28fc3adc28f564cc3a4c39dc28fc28fc28f52c3a4c2aac28fc3adc28f564c5e
EUC-JP テ、テ敖渉渉蹴テ、ツェツ湘ュツ酬Lテ、テ敖渉渉蹴テ、ツェツ湘ュツ酬L^ 10001110110000111000111010100100100011101100001111011010110001001011111011000100101111101100010010111101101100111000111011000011100011101010010010001110110000101000111010101010100011101100001010111110110001011000111010101101100011101100001010111101101101110100110010001110110000111000111010100100100011101100001111011010110001001011111011000100101111101100010010111101101100111000111011000011100011101010010010001110110000101000111010101010100011101100001010111110110001011000111010101101100011101100001010111101101101110100110001011110 8ec38ea48ec3dac4bec4bec4bdb38ec38ea48ec28eaa8ec2bec58ead8ec2bdb74c8ec38ea48ec3dac4bec4bec4bdb38ec38ea48ec28eaa8ec2bec58ead8ec2bdb74c5e
UTF-8 テ、テ敖渉渉蹴テ、ツェツ湘ュツ酬Lテ、テ敖渉渉蹴テ、ツェツ湘ュツ酬L^ 111011111011111010000011111011111011110110100100111011111011111010000011111001101001010110010110111001101011100010001001111001101011100010001001111010001011100110110100111011111011111010000011111011111011110110100100111011111011111010000010111011111011110110101010111011111011111010000010111001101011100110011000111011111011110110101101111011111011111010000010111010011000010110101100010011001110111110111110100000111110111110111101101001001110111110111110100000111110011010010101100101101110011010111000100010011110011010111000100010011110100010111001101101001110111110111110100000111110111110111101101001001110111110111110100000101110111110111101101010101110111110111110100000101110011010111001100110001110111110111101101011011110111110111110100000101110100110000101101011000100110001011110 efbe83efbda4efbe83e69596e6b889e6b889e8b9b4efbe83efbda4efbe82efbdaaefbe82e6b998efbdadefbe82e985ac4cefbe83efbda4efbe83e69596e6b889e6b889e8b9b4efbe83efbda4efbe82efbdaaefbe82e6b998efbdadefbe82e985ac4c5e
UHC ???敖??蹴?????湘??酬L???敖??蹴?????湘??酬L^ 00111111001111110011111111100111111110010011111100111111111101011110110100111111001111110011111100111111001111111101111111001111001111110011111111100010110001100100110000111111001111110011111111100111111110010011111100111111111101011110110100111111001111110011111100111111001111111101111111001111001111110011111111100010110001100100110001011110 3f3f3fe7f93f3ff5ed3f3f3f3f3fdfcf3f3fe2c64c3f3f3fe7f93f3ff5ed3f3f3f3f3fdfcf3f3fe2c64c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)