To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?轅?垣矜??啼?仲?縡?轅?垣矜??啼?仲?^ 11100011011100010011111111100111011101100011111110001010010111111110000111100000001111110011111110011010011001010011111110010010100001110011111111100011011100010011111111100111011101100011111110001010010111111110000111100000001111110011111110011010011001010011111110010010100001110011111101011110 e3713fe7763f8a5fe1e03f3f9a653f92873fe3713fe7763f8a5fe1e03f3f9a653f92873f5e
EUC-JP 縡?轅?垣矜??啼?仲?縡?轅?垣矜??啼?仲?^ 11100101110100100011111111101101110101110011111110110011110000001110001011100010001111110011111111010011110001100011111111000011111001110011111111100101110100100011111111101101110101110011111110110011110000001110001011100010001111110011111111010011110001100011111111000011111001110011111101011110 e5d23fedd73fb3c0e2e23f3fd3c63fc3e73fe5d23fedd73fb3c0e2e23f3fd3c63fc3e73f5e
UTF-8 縡렕轅렋垣矜렣렔啼렮仲줏縡렕轅렋垣矜렣렔啼렮仲줌^ 11100111101110001010000111101011101000001001010111101000101111011000010111101011101000001000101111100101100111101010001111100111100111111001110011101011101000001010001111101011101000001001010011100101100101011011110011101011101000001010111011100100101110111011001011101100101001001000111111100111101110001010000111101011101000001001010111101000101111011000010111101011101000001000101111100101100111101010001111100111100111111001110011101011101000001010001111101011101000001001010011100101100101011011110011101011101000001010111011100100101110111011001011101100101001001000110001011110 e7b8a1eba095e8bd85eba08be59ea3e79f9ceba0a3eba094e595bceba0aee4bbb2eca48fe7b8a1eba095e8bd85eba08be59ea3e79f9ceba0a3eba094e595bceba0aee4bbb2eca48c5e
UHC 縡렕轅렋垣矜렣렔啼렮仲줏縡렕轅렋垣矜렣렔啼렮仲줌^ 11101110101011011000111010101010111010101011111110001110101000101110101010101111110100001110100010001110101101001000111010101001111100001010011010001110101110111111000111101010110000011101111011101110101011011000111010101010111010101011111110001110101000101110101010101111110100001110100010001110101101001000111010101001111100001010011010001110101110111111000111101010110000011101110001011110 eead8eaaeabf8ea2eaafd0e88eb48ea9f0a68ebbf1eac1deeead8eaaeabf8ea2eaafd0e88eb48ea9f0a68ebbf1eac1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)