To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????J 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4a
SJIS-WIN テサテ敖「テ・ツヲツャツ渉ケツキツ篠ケテサJ 11000011101110111100001110011101110000101010001011000011101001011100001010100110110000101010110011000010100011111100001010111001110000101011011111000010100011101100001010111001110000111011101101001010 c3bbc39dc2a2c3a5c2a6c2acc28fc2b9c2b7c28ec2b9c3bb4a
EUC-JP テサテ敖「テ・ツヲツャツ渉ケツキツ篠ケテサJ 10001110110000111000111010111011100011101100001111011010110001001000111010100010100011101100001110001110101001011000111011000010100011101010011010001110110000101000111010101100100011101100001010111110110001001000111010111001100011101100001010001110101101111000111011000010101111001100010010001110101110011000111011000011100011101011101101001010 8ec38ebb8ec3dac48ea28ec38ea58ec28ea68ec28eac8ec2bec48eb98ec28eb78ec2bcc48eb98ec38ebb4a
UTF-8 テサテ敖「テ・ツヲツャツ渉ケツキツ篠ケテサJ 11101111101111101000001111101111101111011011101111101111101111101000001111100110100101011001011011101111101111011010001011101111101111101000001111101111101111011010010111101111101111101000001011101111101111011010011011101111101111101000001011101111101111011010110011101111101111101000001011100110101110001000100111101111101111011011100111101111101111101000001011101111101111011011011111101111101111101000001011100111101011111010000011101111101111011011100111101111101111101000001111101111101111011011101101001010 efbe83efbdbbefbe83e69596efbda2efbe83efbda5efbe82efbda6efbe82efbdacefbe82e6b889efbdb9efbe82efbdb7efbe82e7afa0efbdb9efbe83efbdbb4a
UHC ???敖?????????????篠???J 001111110011111100111111111001111111100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111111000011100011000111111001111110011111101001010 3f3f3fe7f93f3f3f3f3f3f3f3f3f3f3f3f3fe1c63f3f3f4a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)