To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ???厭??語⑨ⅲ???語①?松???ょ?擬??\ 00111111001111110011111110001001011111010011111100111111100011001110101010000111010010001111101001000010001111110011111100111111100011001110101010000111010000000011111110001111101111000011111100111111001111111000001011100101001111111000101101011011001111110011111101011100 3f3f3f897d3f3f8cea8748fa423f3f3f8cea87403f8fbc3f3f3f82e53f8b5b3f3f5c
EUC-JP 轝??厭??語??轝??語??松???ょ?擬??\ 1000111111100001101010100011111100111111101100011101111000111111001111111011100011101100001111110011111110001111111000011010101000111111001111111011100011101100001111110011111110111110101111100011111100111111001111111010010011100111001111111011010110111100001111110011111101011100 8fe1aa3f3fb1de3f3fb8ec3f3f8fe1aa3f3fb8ec3f3fbebe3f3f3fa4e73fb5bc3f3f5c
UTF-8 轝뚮젶厭묐젒語⑨ⅲ轝뚮젶語①꺗松㎪쵊溜ょ뼇擬묐콐\ 11101000101111011001110111101011100110101010111011101100101000001011011011100101100011101010110111101011101011001001000011101100101000001001001011101000101010101001111011100010100100011010100011100010100001011011001011101000101111011001110111101011100110101010111011101100101000001011011011101000101010101001111011100010100100011010000011101010101110101001011111100110100111011011111011100011100011101010101011101100101101011000101011101111101001111000101111100011100000101000011111101011101111001000011111100110100100111010110011101011101011001001000011101100101111011001000001011100 e8bd9deb9aaeeca0b6e58eadebac90eca092e8aa9ee291a8e285b2e8bd9deb9aaeeca0b6e8aa9ee291a0eaba97e69dbee38eaaecb58aefa78be38287ebbc87e693acebac90ecbd905c
UHC 轝뚮젶厭묐젒語⑨ⅲ轝뚮젶語①꺗松㎪쵊溜ょ뼇擬묐콐\ 11100110101011001000110011101011101000001010101011100110111101001001000111101011101000001001000111100101110111101010100011101111101001011010001111100110101011001000110011101011101000001010101011100101110111101010100011100111100000111011110111100001111001101010011111100110101011001000110011101010111111101010101011100111100101101001000111101011111101001001000111101011101100011000110001011100 e6ac8ceba0aae6f491eba091e5dea8efa5a3e6ac8ceba0aae5dea8e783bde1e6a7e6ac8ceafeaae79691ebf491ebb18c5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)