To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???語??????ら?宥?????語①?松 00111111001111110011111110001100111010100011111100111111001111110011111100111111001111111000001011100111001111111001011101000111001111110011111100111111001111110011111110001100111010101000011101000000001111111000111110111100 3f3f3f8cea3f3f3f3f3f3f82e73f97473f3f3f3f3f8cea87403f8fbc
EUC-JP 轝??語??????ら?宥??轝??語??松 10001111111000011010101000111111001111111011100011101100001111110011111100111111001111110011111100111111101001001110100100111111110011011010100000111111001111111000111111100001101010100011111100111111101110001110110000111111001111111011111010111110 8fe1aa3f3fb8ec3f3f3f3f3f3fa4e93fcda83f3f8fe1aa3f3fb8ec3f3fbebe
UTF-8 轝뚮젶語ⓨ낡隸욄쵊溜ら냽宥밸퀡轝뚮젶語①꺗松 111010001011110110011101111010111001101010101110111011001010000010110110111010001010101010011110111000101001001110101000111010111000001010100001111011111010011010111000111011001001101010000100111011001011010110001010111011111010011110001011111000111000001010001001111010111000001110111101111001011010111010100101111010111011000010111000111011011000000010100001111010001011110110011101111010111001101010101110111011001010000010110110111010001010101010011110111000101001000110100000111010101011101010010111111001101001110110111110 e8bd9deb9aaeeca0b6e8aa9ee293a8eb82a1efa6b8ec9a84ecb58aefa78be38289eb83bde5aea5ebb0b8ed80a1e8bd9deb9aaeeca0b6e8aa9ee291a0eaba97e69dbe
UHC 轝뚮젶語ⓨ낡隸욄쵊溜ら냽宥밸퀡轝뚮젶語①꺗松 1110011010101100100011001110101110100000101010101110010111011110101010001110010110110011101100001110011111100110100111101110011010101100100011001110101011111110101010101110100110000110100011011110101011101001101110011110101110110011100101011110011010101100100011001110101110100000101010101110010111011110101010001110011110000011101111011110000111100110 e6ac8ceba0aae5dea8e5b3b0e7e69ee6ac8ceafeaae9868deae9b9ebb395e6ac8ceba0aae5dea8e783bde1e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)