To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?霽?鬱屯?霽?鬱憺淨?霽?鬱屯?霽?鬱憺^ 10011111110001000011111111101000110001110011111110011111010101001001001111010100001111111110100011000111001111111001111101010100100111001110100110011111110001000011111111101000110001110011111110011111010101001001001111010100001111111110100011000111001111111001111101010100100111001110100101011110 9fc43fe8c73f9f5493d43fe8c73f9f549ce99fc43fe8c73f9f5493d43fe8c73f9f549ce95e
EUC-JP 淨?霽?鬱屯?霽?鬱憺淨?霽?鬱屯?霽?鬱憺^ 11011110110001100011111111110000110010010011111111011101101101011100011011010110001111111111000011001001001111111101110110110101110110001110101111011110110001100011111111110000110010010011111111011101101101011100011011010110001111111111000011001001001111111101110110110101110110001110101101011110 dec63ff0c93fddb5c6d63ff0c93fddb5d8ebdec63ff0c93fddb5c6d63ff0c93fddb5d8eb5e
UTF-8 淨렠霽렢鬱屯㉢霽렢鬱憺淨렠霽렢鬱屯㉢霽렢鬱憺^ 11100110101101111010100011101011101000001010000011101001100111001011110111101011101000001010001011101001101011001011000111100101101100011010111111100011100010011010001011101001100111001011110111101011101000001010001011101001101011001011000111100110100001101011101011100110101101111010100011101011101000001010000011101001100111001011110111101011101000001010001011101001101011001011000111100101101100011010111111100011100010011010001011101001100111001011110111101011101000001010001011101001101011001011000111100110100001101011101001011110 e6b7a8eba0a0e99cbdeba0a2e9acb1e5b1afe389a2e99cbdeba0a2e9acb1e686bae6b7a8eba0a0e99cbdeba0a2e9acb1e5b1afe389a2e99cbdeba0a2e9acb1e686ba5e
UHC 淨렠霽렢鬱屯㉢霽렢鬱憺淨렠霽렢鬱屯㉢霽렢鬱憺^ 111011111110010010001110101100011111000010111000100011101011001111101010101001101101010011101010101010001011001111110000101110001000111010110011111010101010011011010011101111001110111111100100100011101011000111110000101110001000111010110011111010101010011011010100111010101010100010110011111100001011100010001110101100111110101010100110110100111011110001011110 efe48eb1f0b88eb3eaa6d4eaa8b3f0b88eb3eaa6d3bcefe48eb1f0b88eb3eaa6d4eaa8b3f0b88eb3eaa6d3bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)