To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 凹??詣??塢??謠??椰?????絶?? 100010011001101000111111001111111000110001110111001111110011111110011010110001110011111100111111111001101000111100111111001111111001111010111101001111110011111100111111001111110011111110010000111000100011111100111111 899a3f3f8c773f3f9ac73f3fe68f3f3f9ebd3f3f3f3f3f90e23f3f
EUC-JP 凹??詣??塢??謠??椰?????絶?? 101100011111101000111111001111111011011111011000001111110011111111010100110010010011111100111111111010111110111100111111001111111101110010111111001111110011111100111111001111110011111111000000111001000011111100111111 b1fa3f3fb7d83f3fd4c93f3febef3f3fdcbf3f3f3f3f3fc0e43f3f
UTF-8 凹앭쳣詣앯뿏塢딉풆謠쇽슴椰됬퀌廉붻짅絶랃숲 111001011000011110111001111011001001010110101101111011001011001110100011111010001010100110100011111011001001010110101111111010111011111110001111111001011010000110100010111010111001010010001001111011011001001010000110111010001010110010100000111011001000011110111101111011001000101010110100111001101010010010110000111010111001000010101100111011011000000010001100111011111010011010100010111010111011011010111011111011001010011110000101111001111011010110110110111010111001111010000011111011001000100010110010 e587b9ec95adecb3a3e8a9a3ec95afebbf8fe5a1a2eb9489ed9286e8aca0ec87bdec8ab4e6a4b0eb90aced808cefa6a2ebb6bbeca785e7b5b6eb9e83ec88b2
UHC 凹앭쳣詣앯뿏塢딉풆謠쇽슴椰됬퀌廉붻짅絶랃숲 111010001110101010011101111001011010101110001001111001111110000110011101111001111001011110010100111001111111000110001010111011111011111010001110111010011010101010111100111011111011110110111111111001011010101110001001111001111011001110000010111001101111010110010100111010001010001110010100111011111011111010001101111011111011110110100011 e8ea9de5ab89e7e19de79794e7f18aefbe8ee9aabcefbdbfe5ab89e7b382e6f594e8a394efbe8defbda3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)