To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?窪校?源?醍??拒淨?窪校?源?醍??居^ 1001111111000100001111111000110001000101100011010101101000111111100011001011100100111111100100011110011100111111001111111000101110010001100111111100010000111111100011000100010110001101010110100011111110001100101110010011111110010001111001110011111100111111100010111000111101011110 9fc43f8c458d5a3f8cb93f91e73f3f8b919fc43f8c458d5a3f8cb93f91e73f3f8b8f5e
EUC-JP 淨?窪校汶源?醍??拒淨?窪校汶源?醍??居^ 110111101100011000111111101101111010011010111001101110111000111111000110111001011011100010111011001111111100001011101001001111110011111110110101111100011101111011000110001111111011011110100110101110011011101110001111110001101110010110111000101110110011111111000010111010010011111100111111101101011110111101011110 dec63fb7a6b9bb8fc6e5b8bb3fc2e93f3fb5f1dec63fb7a6b9bb8fc6e5b8bb3fc2e93f3fb5ef5e
UTF-8 淨렠窪校汶源렰醍닸렮拒淨렠窪校汶源렰醍닸렮居^ 11100110101101111010100011101011101000001010000011100111101010101010101011100110101000001010000111100110101100011011011011100110101110101001000011101011101000001011000011101001100001101000110111101011100010111011100011101011101000001010111011100110100010111001001011100110101101111010100011101011101000001010000011100111101010101010101011100110101000001010000111100110101100011011011011100110101110101001000011101011101000001011000011101001100001101000110111101011100010111011100011101011101000001010111011100101101100011000010101011110 e6b7a8eba0a0e7aaaae6a0a1e6b1b6e6ba90eba0b0e9868deb8bb8eba0aee68b92e6b7a8eba0a0e7aaaae6a0a1e6b1b6e6ba90eba0b0e9868deb8bb8eba0aee5b1855e
UHC 淨렠窪校汶源렰醍닸렮拒淨렠窪校汶源렰醍닸렮居^ 111011111110010010001110101100011110100011000001110011101110100011011010101000011110101010111001100011101011110111110000101101011011010011100110100011101011101111001011110111101110111111100100100011101011000111101000110000011100111011101000110110101010000111101010101110011000111010111101111100001011010110110100111001101000111010111011110010111101110001011110 efe48eb1e8c1cee8daa1eab98ebdf0b5b4e68ebbcbdeefe48eb1e8c1cee8daa1eab98ebdf0b5b4e68ebbcbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)