To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 繒貊?郁?濟?烝?繒貊?郁?濟?再?^ 1111101110001111111001101011101100111111100010001110100000111111111000000101101000111111111000000111111000111111111110111000111111100110101110110011111110001000111010000011111111100000010110100011111110001101110001000011111101011110 fb8fe6bb3f88e83fe05a3fe07e3ffb8fe6bb3f88e83fe05a3f8dc43f5e
EUC-JP 繒貊?郁?濟?烝?繒貊?郁?濟?再?^ 10001111110101001101010011101100101111010011111110110000111010100011111111011111101110110011111111011111110111110011111110001111110101001101010011101100101111010011111110110000111010100011111111011111101110110011111110111010110001100011111101011110 8fd4d4ecbd3fb0ea3fdfbb3fdfdf3f8fd4d4ecbd3fb0ea3fdfbb3fbac63f5e
UTF-8 繒貊웡郁렠濟렩烝렎繒貊웡郁렠濟렩再렲^ 11100111101110011001001011101000101100101000101011101100100110111010000111101001100000111000000111101011101000001010000011100110101111111001111111101011101000001010100111100111100000111001110111101011101000001000111011100111101110011001001011101000101100101000101011101100100110111010000111101001100000111000000111101011101000001010000011100110101111111001111111101011101000001010100111100101100001101000110111101011101000001011001001011110 e7b992e8b28aec9ba1e98381eba0a0e6bf9feba0a9e7839deba08ee7b992e8b28aec9ba1e98381eba0a0e6bf9feba0a9e5868deba0b25e
UHC 繒貊웡郁렠濟렩烝렎繒貊웡郁렠濟렩再렲^ 11110001111110011101100011100111101111111111110111101001111101001000111010110001111100001010110110001110101101111111000111110110100011101010010011110001111110011101100011100111101111111111110111101001111101001000111010110001111100001010110110001110101101111110111010100010100011101011111101011110 f1f9d8e7bffde9f48eb1f0ad8eb7f1f68ea4f1f9d8e7bffde9f48eb1f0ad8eb7eea28ebf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)