To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN テサテ敕暗ァツ篠、ツ篠ァテサテ敕暗ァテ崚ィ^ 1100001110111011110000111001110111000011100010001100001110100111110000101000111011000010101001001100001010001110110000101010011111000011101110111100001110011101110000111000100011000011101001111100001110011011110000111010100001011110 c3bbc39dc388c3a7c28ec2a4c28ec2a7c3bbc39dc388c3a7c39bc3a85e
EUC-JP テサテ敕暗ァツ篠、ツ篠ァテサテ敕暗ァテ崚ィ^ 10001110110000111000111010111011100011101100001111011010110001011011000011000101100011101010011110001110110000101011110011000100100011101010010010001110110000101011110011000100100011101010011110001110110000111000111010111011100011101100001111011010110001011011000011000101100011101010011110001110110000111101011011000101100011101010100001011110 8ec38ebb8ec3dac5b0c58ea78ec2bcc48ea48ec2bcc48ea78ec38ebb8ec3dac5b0c58ea78ec3d6c58ea85e
UTF-8 テサテ敕暗ァツ篠、ツ篠ァテサテ敕暗ァテ崚ィ^ 11101111101111101000001111101111101111011011101111101111101111101000001111100110100101011001010111100110100110101001011111101111101111011010011111101111101111101000001011100111101011111010000011101111101111011010010011101111101111101000001011100111101011111010000011101111101111011010011111101111101111101000001111101111101111011011101111101111101111101000001111100110100101011001010111100110100110101001011111101111101111011010011111101111101111101000001111100101101101001001101011101111101111011010100001011110 efbe83efbdbbefbe83e69595e69a97efbda7efbe82e7afa0efbda4efbe82e7afa0efbda7efbe83efbdbbefbe83e69595e69a97efbda7efbe83e5b49aefbda85e
UHC ????暗??篠??篠?????暗????^ 0011111100111111001111110011111111100100110111100011111100111111111000011100011000111111001111111110000111000110001111110011111100111111001111110011111111100100110111100011111100111111001111110011111101011110 3f3f3f3fe4de3f3fe1c63f3fe1c63f3f3f3f3fe4de3f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)