To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 綜???堯逢??熊D綜???堯逢??熊D^ 1001000110001110001111110011111100111111111010101001111110001000101001110011111100111111100011000100011001000100100100011000111000111111001111110011111111101010100111111000100010100111001111110011111110001100010001100100010001011110 918e3f3f3fea9f88a73f3f8c4644918e3f3f3fea9f88a73f3f8c46445e
EUC-JP 綜???堯逢??熊D綜???堯逢??熊D^ 1100000111101110001111110011111100111111111101001010000110110000101010010011111100111111101101111010011101000100110000011110111000111111001111110011111111110100101000011011000010101001001111110011111110110111101001110100010001011110 c1ee3f3f3ff4a1b0a93f3fb7a744c1ee3f3f3ff4a1b0a93f3fb7a7445e
UTF-8 綜븀렪렧堯逢렱쮜熊D綜븀렪렧堯逢렱쮜熊D^ 111001111011011010011100111010111011100010000000111010111010000010101010111010111010000010100111111001011010000010101111111010011000000010100010111010111010000010110001111011001010111010011100111001111000011010001010010001001110011110110110100111001110101110111000100000001110101110100000101010101110101110100000101001111110010110100000101011111110100110000000101000101110101110100000101100011110110010101110100111001110011110000110100010100100010001011110 e7b69cebb880eba0aaeba0a7e5a0afe980a2eba0b1ecae9ce7868a44e7b69cebb880eba0aaeba0a7e5a0afe980a2eba0b1ecae9ce7868a445e
UHC 綜븀렪렧堯逢렱쮜熊D綜븀렪렧堯逢렱쮜熊D^ 111100001111110010111010111001111000111010111000100011101011011011101000111010111101110011110001100011101011111011000010111010001110101010101000010001001111000011111100101110101110011110001110101110001000111010110110111010001110101111011100111100011000111010111110110000101110100011101010101010000100010001011110 f0fcbae78eb88eb6e8ebdcf18ebec2e8eaa844f0fcbae78eb88eb6e8ebdcf18ebec2e8eaa8445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)