To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??D?????????D???????B 001111110011111101000100001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111101000010 3f3f443f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f42
SJIS-WIN テ、Dテ「テ禿」ツQテ、Dテ「テ禿」ツQB 11000011101001000100010011000011101000101100001110010011110000111010001111000010100000100111000011000011101001000100010011000011101000101100001110010011110000111010001111000010100000100111000001000010 c3a444c3a2c393c3a3c28270c3a444c3a2c393c3a3c2827042
EUC-JP テ、Dテ「テ禿」ツQテ、Dテ「テ禿」ツQB 100011101100001110001110101001000100010010001110110000111000111010100010100011101100001111000110110001011000111010100011100011101100001010100011110100011000111011000011100011101010010001000100100011101100001110001110101000101000111011000011110001101100010110001110101000111000111011000010101000111101000101000010 8ec38ea4448ec38ea28ec3c6c58ea38ec2a3d18ec38ea4448ec38ea28ec3c6c58ea38ec2a3d142
UTF-8 テ、Dテ「テ禿」ツQテ、Dテ「テ禿」ツQB 111011111011111010000011111011111011110110100100010001001110111110111110100000111110111110111101101000101110111110111110100000111110011110100110101111111110111110111101101000111110111110111110100000101110111110111100101100011110111110111110100000111110111110111101101001000100010011101111101111101000001111101111101111011010001011101111101111101000001111100111101001101011111111101111101111011010001111101111101111101000001011101111101111001011000101000010 efbe83efbda444efbe83efbda2efbe83e7a6bfefbda3efbe82efbcb1efbe83efbda444efbe83efbda2efbe83e7a6bfefbda3efbe82efbcb142
UHC ??D???禿??Q??D???禿??QB 00111111001111110100010000111111001111110011111111010100101111100011111100111111101000111101000100111111001111110100010000111111001111110011111111010100101111100011111100111111101000111101000101000010 3f3f443f3f3fd4be3f3fa3d13f3f443f3f3fd4be3f3fa3d142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)