To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?瓏??沚基??沚亘??張?障?諸?霆 00111111111000001111101000111111001111111001111110001101100010101110111000111111001111111001111110001101100110000110101000111111001111111001001010100011001111111000111111100001001111111000111110010100001111111110100010111011 3fe0fa3f3f9f8d8aee3f3f9f8d986a3f3f92a33f8fe13f8f943fe8bb
EUC-JP ?瓏??沚基??沚亘??張?障?諸?霆 00111111111000001111110000111111001111111101110111101101101101001111000000111111001111111101110111101101110011111100101100111111001111111100010010100101001111111011111011100011001111111011110111110100001111111111000010111101 3fe0fc3f3fddedb4f03f3fddedcfcb3f3fc4a53fbee33fbdf43ff0bd
UTF-8 亐瓏렚렞沚基렰렖沚亘렭렏張렋障렜諸렪霆 111001001011101010010000111001111001001110001111111010111010000010011010111010111010000010011110111001101011001010011010111001011001111110111010111010111010000010110000111010111010000010010110111001101011001010011010111001001011101010011000111010111010000010101101111010111010000010001111111001011011110010110101111010111010000010001011111010011001101010011100111010111010000010011100111010001010101110111000111010111010000010101010111010011001110010000110 e4ba90e7938feba09aeba09ee6b29ae59fbaeba0b0eba096e6b29ae4ba98eba0adeba08fe5bcb5eba08be99a9ceba09ce8abb8eba0aae99c86
UHC 亐瓏렚렞沚基렰렖沚亘렭렏張렋障렜諸렪霆 1110101010100111110101101110101010001110101011011000111010101111111100101010111111010000111100011000111010111101100011101010101111110010101011111101000011100110100011101011101010001110101001011110110111100101100011101010001011101110101000011000111010101110111100001011001110001110101110001110111111111101 eaa7d6ea8ead8eaff2afd0f18ebd8eabf2afd0e68eba8ea5ede58ea2eea18eaef0b38eb8effd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)