To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 凹??詣??鹽??節??椰?????絶??^ 10001001100110100011111100111111100011000111011100111111001111111110101001100100001111110011111110010000110111110011111100111111100111101011110100111111001111110011111100111111001111111001000011100010001111110011111101011110 899a3f3f8c773f3fea643f3f90df3f3f9ebd3f3f3f3f3f90e23f3f5e
EUC-JP 凹??詣??鹽??節??椰?????絶??^ 10110001111110100011111100111111101101111101100000111111001111111111001111000101001111110011111111000000111000010011111100111111110111001011111100111111001111110011111100111111001111111100000011100100001111110011111101011110 b1fa3f3fb7d83f3ff3c53f3fc0e13f3fdcbf3f3f3f3f3fc0e43f3f5e
UTF-8 凹앭쳣詣앯뿏鹽억풃節쇤뙘椰됭깮廉붺ㅎ絶뗰슨^ 11100101100001111011100111101100100101011010110111101100101100111010001111101000101010011010001111101100100101011010111111101011101111111000111111101001101110011011110111101100100101101011010111101101100100101000001111100111101011111000000011101100100001111010010011101011100110011001100011100110101001001011000011101011100100001010110111101010101110011010111011101111101001101010001011101011101101101011101011100011100001011000111011100111101101011011011011101011100101111011000011101100100010101010100001011110 e587b9ec95adecb3a3e8a9a3ec95afebbf8fe9b9bdec96b5ed9283e7af80ec87a4eb9998e6a4b0eb90adeab9aeefa6a2ebb6bae3858ee7b5b6eb97b0ec8aa85e
UHC 凹앭쳣詣앯뿏鹽억풃節쇤뙘椰됭깮廉붺ㅎ絶뗰슨^ 11101000111010101001110111100101101010111000100111100111111000011001110111100111100101111001010011100111101001001011111011101111101111101000101111101111101111011011110011101001100011001001110111100101101010111000100111101000100000111001110111100110111101011001010011100111101001001011111011101111101111101000101111101111101111011011110001011110 e8ea9de5ab89e7e19de79794e7a4beefbe8befbdbce98c9de5ab89e8839de6f594e7a4beefbe8befbdbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)