To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 止??兢侶靖?藕?止??兢侶靖?藕?^ 1000111001111110001111110011111110011001010111011001011110110101100101101111010100111111111001010101100000111111100011100111111000111111001111111001100101011101100101111011010110010110111101010011111111100101010110000011111101011110 8e7e3f3f995d97b596f53fe5583f8e7e3f3f995d97b596f53fe5583f5e
EUC-JP 止??兢侶靖?藕?止??兢侶靖?藕?^ 1011101111011111001111110011111111010001101111101100111010110111110011001111011100111111111010011011100100111111101110111101111100111111001111111101000110111110110011101011011111001100111101110011111111101001101110010011111101011110 bbdf3f3fd1beceb7ccf73fe9b93fbbdf3f3fd1beceb7ccf73fe9b93f5e
UTF-8 止얗렒兢侶靖렢藕렕止얗렒兢侶靖렢藕렕^ 11100110101011011010001011101100100101101001011111101011101000001001001011100101100001011010001011100100101111101011011011101001100111011001011011101011101000001010001011101000100101111001010111101011101000001001010111100110101011011010001011101100100101101001011111101011101000001001001011100101100001011010001011100100101111101011011011101001100111011001011011101011101000001010001011101000100101111001010111101011101000001001010101011110 e6ada2ec9697eba092e585a2e4beb6e99d96eba0a2e89795eba095e6ada2ec9697eba092e585a2e4beb6e99d96eba0a2e89795eba0955e
UHC 止얗렒兢侶靖렢藕렕止얗렒兢侶靖렢藕렕^ 11110010101011011011111011101001100011101010011111010000111001111101010111100010111011111111111010001110101100111110100111100100100011101010101011110010101011011011111011101001100011101010011111010000111001111101010111100010111011111111111010001110101100111110100111100100100011101010101001011110 f2adbee98ea7d0e7d5e2effe8eb3e9e48eaaf2adbee98ea7d0e7d5e2effe8eb3e9e48eaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)