To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 證??鬱?諸?誼潘濾煜?鬱?諸?毅?^ 1110011010011010001111110011111110011111010101000011111110001111100101000011111110001011011000101110000001001110111000000110100011111011010101010011111110011111010101000011111110001111100101000011111110001011010000100011111101011110 e69a3f3f9f543f8f943f8b62e04ee068fb553f9f543f8f943f8b423f5e
EUC-JP 證??鬱?諸?誼潘濾煜?鬱?諸?毅?^ 111010111111101000111111001111111101110110110101001111111011110111110100001111111011010111000011110111111010111111011111110010011000111111001001111111000011111111011101101101010011111110111101111101000011111110110101101000110011111101011110 ebfa3f3fddb53fbdf43fb5c3dfafdfc98fc9fc3fddb53fbdf43fb5a33f5e
UTF-8 證띄캑鬱렋諸렪誼潘濾煜렡鬱렋諸렪毅렰^ 11101000101011011000100111101011100111011000010011101100101110101001000111101001101011001011000111101011101000001000101111101000101010111011100011101011101000001010101011101000101010101011110011100110101111011001100011100110101111111011111011100111100001011001110011101011101000001010000111101001101011001011000111101011101000001000101111101000101010111011100011101011101000001010101011100110101011111000010111101011101000001011000001011110 e8ad89eb9d84ecba91e9acb1eba08be8abb8eba0aae8aabce6bd98e6bfbee7859ceba0a1e9acb1eba08be8abb8eba0aae6af85eba0b05e
UHC 證띄캑鬱렋諸렪誼潘濾煜렡鬱렋諸렪毅렰^ 11110001111110111011011011100111110001001011010011101010101001101000111010100010111100001011001110001110101110001110101111111110110110101110101111010101111010111110100111110010100011101011001011101010101001101000111010100010111100001011001110001110101110001110101111110110100011101011110101011110 f1fbb6e7c4b4eaa68ea2f0b38eb8ebfedaebd5ebe9f28eb2eaa68ea2f0b38eb8ebf68ebd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)