To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癰????????熱??鍮?????壹①?^ 111000011001111000111111001111110011111100111111001111110011111100111111001111111001010001001101001111110011111111101000010010100011111100111111001111110011111100111111100110101110001110000111010000000011111101011110 e19e3f3f3f3f3f3f3f3f944d3f3fe84a3f3f3f3f3f9ae387403f5e
EUC-JP 癰????????熱??鍮?????壹??^ 1110000111111110001111110011111100111111001111110011111100111111001111110011111111000111101011100011111100111111111011111010101100111111001111110011111100111111001111111101010011100101001111110011111101011110 e1fe3f3f3f3f3f3f3f3fc7ae3f3fefab3f3f3f3f3fd4e53f3f5e
UTF-8 癰꾨퉬溜곕젨麗덊렓熱㎬퉬鍮꾨젾溜졿컝壹①탳^ 11100111100110011011000011101010101111101010100011101101100010011010110011101111101001111000101111101010101100111001010111101100101000001010100011101111101001101000100011101011100011011000101011101011101000001001001111100111100001101011000111100011100011101010110011101101100010011010110011101001100011011010111011101010101111101010100011101100101000001011111011101111101001111000101111101100101000011011111111101100101110111001110111100101101000111011100111100010100100011010000011101101100000111011001101011110 e799b0eabea8ed89acefa78beab395eca0a8efa688eb8d8aeba093e786b1e38eaced89ace98daeeabea8eca0beefa78beca1bfecbb9de5a3b9e291a0ed83b35e
UHC 癰꾨퉬溜곕젨麗덊렓熱㎬퉬鍮꾨젾溜졿컝壹①탳^ 11101000101110011000010011101011101110011000010011101010111111101011000011101011101000001010000011100110101100001000100011101101100011101010100011100110111100001010011111101000101110011000010011101011101110011000010011101011101000001011000011101010111111101010000011100110101100001000100011101100111011001010100011100111101101011001000001011110 e8b984ebb984eafeb0eba0a0e6b088ed8ea8e6f0a7e8b984ebb984eba0b0eafea0e6b088ececa8e7b5905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)