To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?制耿??縡?寃??狡??畯脈??梯?趙脈? 11100011011100010011111110010000101001111110001111010100001111110011111111100011011100010011111110011011100000110011111100111111111000001100001000111111001111111111101101101111100101101010110000111111001111111001001011110010001111111110011011100010100101101010110000111111 e3713f90a7e3d43f3fe3713f9b833f3fe0c23f3ffb6f96ac3f3f92f23fe6e296ac3f
EUC-JP 縡?制耿??縡?寃??狡??畯脈??梯?趙脈? 1110010111010010001111111100000010101001111001101101011000111111001111111110010111010010001111111101010111100011001111110011111111100000110001000011111100111111100011111100110110111011110011001010111000111111001111111100010011110100001111111110110011100100110011001010111000111111 e5d23fc0a9e6d63f3fe5d23fd5e33f3fe0c43f3f8fcdbbccae3f3fc4f43fece4ccae3f
UTF-8 縡렕制耿렧렢縡렕寃닸렲狡렕렟畯脈롛렣梯렟趙脈렲 111001111011100010100001111010111010000010010101111001011000100010110110111010001000000010111111111010111010000010100111111010111010000010100010111001111011100010100001111010111010000010010101111001011010111110000011111010111000101110111000111010111010000010110010111001111000101110100001111010111010000010010101111010111010000010011111111001111001010110101111111010001000010010001000111010111010000110011011111010111010000010100011111001101010001010101111111010111010000010011111111010001011011010011001111010001000010010001000111010111010000010110010 e7b8a1eba095e588b6e880bfeba0a7eba0a2e7b8a1eba095e5af83eb8bb8eba0b2e78ba1eba095eba09fe795afe88488eba19beba0a3e6a2afeba09fe8b699e88488eba0b2
UHC 縡렕制耿렧렢縡렕寃닸렲狡렕렟畯脈롛렣梯렟趙脈렲 11101110101011011000111010101010111100001010010011001100111010101000111010110110100011101011001111101110101011011000111010101010111010101011001010110100111001101000111010111111110011101110101010001110101010101000111010110000111100011110000111011000111001101000111011011111100011101011010011110000101011001000111010110000111100001110000111011000111001101000111010111111 eead8eaaf0a4ccea8eb68eb3eead8eaaeab2b4e68ebfceea8eaa8eb0f1e1d8e68edf8eb4f0ac8eb0f0e1d8e68ebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)