To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN テ・ツサツエテ・ツ陳・テゥツδ撮 11000011101001011100001010111011110000101011010011000011101001011100001010010010110000101010010111000011101010011100001010000011110000101000111001000010 c3a5c2bbc2b4c3a5c292c2a5c3a9c283c28e42
EUC-JP テ・ツサツエテ・ツ陳・テゥツδ撮 1000111011000011100011101010010110001110110000101000111010111011100011101100001010001110101101001000111011000011100011101010010110001110110000101100010011000100100011101010010110001110110000111000111010101001100011101100001010100110110001001011101110100011 8ec38ea58ec28ebb8ec28eb48ec38ea58ec2c4c48ea58ec38ea98ec2a6c4bba3
UTF-8 テ・ツサツエテ・ツ陳・テゥツδ撮 1110111110111110100000111110111110111101101001011110111110111110100000101110111110111101101110111110111110111110100000101110111110111101101101001110111110111110100000111110111110111101101001011110111110111110100000101110100110011001101100111110111110111101101001011110111110111110100000111110111110111101101010011110111110111110100000101100111010110100111001101001001010101110 efbe83efbda5efbe82efbdbbefbe82efbdb4efbe83efbda5efbe82e999b3efbda5efbe83efbda9efbe82ceb4e692ae
UHC ?????????陳????δ撮 00111111001111110011111100111111001111110011111100111111001111110011111111110010111001110011111100111111001111110011111110100101111001001111010111001001 3f3f3f3f3f3f3f3f3ff2e73f3f3f3fa5e4f5c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)