To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?淨?鬱豆??垣矜?縡?淨?鬱豆??垣矜?^ 1110001101110001001111111001111111000100001111111001111101010100100100111010010000111111001111111000101001011111111000011110000000111111111000110111000100111111100111111100010000111111100111110101010010010011101001000011111100111111100010100101111111100001111000000011111101011110 e3713f9fc43f9f5493a43f3f8a5fe1e03fe3713f9fc43f9f5493a43f3f8a5fe1e03f5e
EUC-JP 縡?淨?鬱豆??垣矜汶縡?淨?鬱豆??垣矜汶^ 111001011101001000111111110111101100011000111111110111011011010111000110101001100011111100111111101100111100000011100010111000101000111111000110111001011110010111010010001111111101111011000110001111111101110110110101110001101010011000111111001111111011001111000000111000101110001010001111110001101110010101011110 e5d23fdec63fddb5c6a63f3fb3c0e2e28fc6e5e5d23fdec63fddb5c6a63f3fb3c0e2e28fc6e55e
UTF-8 縡렕淨렠鬱豆렩렰垣矜汶縡렕淨렠鬱豆렩렰垣矜汶^ 11100111101110001010000111101011101000001001010111100110101101111010100011101011101000001010000011101001101011001011000111101000101100011000011011101011101000001010100111101011101000001011000011100101100111101010001111100111100111111001110011100110101100011011011011100111101110001010000111101011101000001001010111100110101101111010100011101011101000001010000011101001101011001011000111101000101100011000011011101011101000001010100111101011101000001011000011100101100111101010001111100111100111111001110011100110101100011011011001011110 e7b8a1eba095e6b7a8eba0a0e9acb1e8b186eba0a9eba0b0e59ea3e79f9ce6b1b6e7b8a1eba095e6b7a8eba0a0e9acb1e8b186eba0a9eba0b0e59ea3e79f9ce6b1b65e
UHC 縡렕淨렠鬱豆렩렰垣矜汶縡렕淨렠鬱豆렩렰垣矜汶^ 111011101010110110001110101010101110111111100100100011101011000111101010101001101101010011100111100011101011011110001110101111011110101010101111110100001110100011011010101000011110111010101101100011101010101011101111111001001000111010110001111010101010011011010100111001111000111010110111100011101011110111101010101011111101000011101000110110101010000101011110 eead8eaaefe48eb1eaa6d4e78eb78ebdeaafd0e8daa1eead8eaaefe48eb1eaa6d4e78eb78ebdeaafd0e8daa15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)