To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癰?????橈??熱??鍮?????壹①?^ 11100001100111100011111100111111001111110011111100111111100111101111010000111111001111111001010001001101001111110011111111101000010010100011111100111111001111110011111100111111100110101110001110000111010000000011111101011110 e19e3f3f3f3f3f9ef43f3f944d3f3fe84a3f3f3f3f3f9ae387403f5e
EUC-JP 癰?????橈??熱??鍮?????壹??^ 111000011111111000111111001111110011111100111111001111111101110011110110001111110011111111000111101011100011111100111111111011111010101100111111001111110011111100111111001111111101010011100101001111110011111101011110 e1fe3f3f3f3f3fdcf63f3fc7ae3f3fefab3f3f3f3f3fd4e53f3f5e
UTF-8 癰꾨퉬溜곕젨橈놃렓熱㎬퉬鍮꾨젾溜좈쐠壹①탳^ 11100111100110011011000011101010101111101010100011101101100010011010110011101111101001111000101111101010101100111001010111101100101000001010100011100110101010011000100011101011100001101000001111101011101000001001001111100111100001101011000111100011100011101010110011101101100010011010110011101001100011011010111011101010101111101010100011101100101000001011111011101111101001111000101111101100101000101000100011101100100100001010000011100101101000111011100111100010100100011010000011101101100000111011001101011110 e799b0eabea8ed89acefa78beab395eca0a8e6a988eb8683eba093e786b1e38eaced89ace98daeeabea8eca0beefa78beca288ec90a0e5a3b9e291a0ed83b35e
UHC 癰꾨퉬溜곕젨橈놃렓熱㎬퉬鍮꾨젾溜좈쐠壹①탳^ 11101000101110011000010011101011101110011000010011101010111111101011000011101011101000001010000011101000111110101000011011101101100011101010100011100110111100001010011111101000101110011000010011101011101110011000010011101011101000001011000011101010111111101010000011101001100111001000011011101100111011001010100011100111101101011001000001011110 e8b984ebb984eafeb0eba0a0e8fa86ed8ea8e6f0a7e8b984ebb984eba0b0eafea0e99c86ececa8e7b5905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)