To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????里??????????里?????^ 00111111001111110011111100111111001111111001011110100010001111110011111100111111001111110011111100111111001111110011111100111111001111111001011110100010001111110011111100111111001111110011111101011110 3f3f3f3f3f97a23f3f3f3f3f3f3f3f3f3f97a23f3f3f3f3f5e
EUC-JP ?????里??????????里?????^ 00111111001111110011111100111111001111111100111010100100001111110011111100111111001111110011111100111111001111110011111100111111001111111100111010100100001111110011111100111111001111110011111101011110 3f3f3f3f3fcea43f3f3f3f3f3f3f3f3f3fcea43f3f3f3f3f5e
UTF-8 앍렰섶솬뤓里슨생쓱롐롗앍렰섶솬뤓里슨생쓱롐롕^ 11101100100101011000110111101011101000001011000011101100100001001011011011101100100001101010110011101011101001001001001111101001100001111000110011101100100010101010100011101100100000111001110111101100100100111011000111101011101000011001000011101011101000011001011111101100100101011000110111101011101000001011000011101100100001001011011011101100100001101010110011101011101001001001001111101001100001111000110011101100100010101010100011101100100000111001110111101100100100111011000111101011101000011001000011101011101000011001010101011110 ec958deba0b0ec84b6ec86aceba493e9878cec8aa8ec839dec93b1eba190eba197ec958deba0b0ec84b6ec86aceba493e9878cec8aa8ec839dec93b1eba190eba1955e
UHC 앍렰섶솬뤓里슨생쓱롐롗앍렰섶솬뤓里슨생쓱롐롕^ 101111101100110010001110101111011011110010111011101111001101111110001111110000111101011111101100101111011011110010111011111111011011111010110011100011101101011010001110110110111011111011001100100011101011110110111100101110111011110011011111100011111100001111010111111011001011110110111100101110111111110110111110101100111000111011010110100011101101100101011110 becc8ebdbcbbbcdf8fc3d7ecbdbcbbfdbeb38ed68edbbecc8ebdbcbbbcdf8fc3d7ecbdbcbbfdbeb38ed68ed95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)