To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?寃?∧源???低大淨?寃?∧源???低垈^ 1001111111000100001111111001101110000011001111111000000111001000100011001011100100111111001111110011111110010010111000011001000111100101100111111100010000111111100110111000001100111111100000011100100010001100101110010011111100111111001111111001001011100001100110101011000001011110 9fc43f9b833f81c88cb93f3f3f92e191e59fc43f9b833f81c88cb93f3f3f92e19ab05e
EUC-JP 淨?寃?∧源???低大淨?寃?∧源???低垈^ 1101111011000110001111111101010111100011001111111010001011001010101110001011101100111111001111110011111111000100111000111100001011100111110111101100011000111111110101011110001100111111101000101100101010111000101110110011111100111111001111111100010011100011110101001011001001011110 dec63fd5e33fa2cab8bb3f3f3fc4e3c2e7dec63fd5e33fa2cab8bb3f3f3fc4e3d4b25e
UTF-8 淨렠寃대∧源렰罹렗低大淨렠寃대∧源렰罹렗低垈^ 11100110101101111010100011101011101000001010000011100101101011111000001111101011100011001000000011100010100010001010011111100110101110101001000011101011101000001011000011101111101001111010011011101011101000001001011111100100101111011000111011100101101001001010011111100110101101111010100011101011101000001010000011100101101011111000001111101011100011001000000011100010100010001010011111100110101110101001000011101011101000001011000011101111101001111010011011101011101000001001011111100100101111011000111011100101100111101000100001011110 e6b7a8eba0a0e5af83eb8c80e288a7e6ba90eba0b0efa7a6eba097e4bd8ee5a4a7e6b7a8eba0a0e5af83eb8c80e288a7e6ba90eba0b0efa7a6eba097e4bd8ee59e885e
UHC 淨렠寃대∧源렰罹렗低大淨렠寃대∧源렰罹렗低垈^ 111011111110010010001110101100011110101010110010101101001110101110100001111111001110101010111001100011101011110111101100101110101000111010101100111011101011100011010011110111101110111111100100100011101011000111101010101100101011010011101011101000011111110011101010101110011000111010111101111011001011101010001110101011001110111010111000110100111101110001011110 efe48eb1eab2b4eba1fceab98ebdecba8eaceeb8d3deefe48eb1eab2b4eba1fceab98ebdecba8eaceeb8d3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)