To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淫??????長?珥◇淫??????長?珥●^ 10001000111110100011111100111111001111110011111100111111001111111001001010110111001111111110000011100000100000011001111010001000111110100011111100111111001111110011111100111111001111111001001010110111001111111110000011100000100000011001110001011110 88fa3f3f3f3f3f3f92b73fe0e0819e88fa3f3f3f3f3f3f92b73fe0e0819c5e
EUC-JP 淫?珽????長?珥◇淫?珽????長?珥●^ 1011000011111100001111111000111111001011111111100011111100111111001111110011111111000100101110010011111111100000111000101010000111111110101100001111110000111111100011111100101111111110001111110011111100111111001111111100010010111001001111111110000011100010101000011111110001011110 b0fc3f8fcbfe3f3f3f3fc4b93fe0e2a1feb0fc3f8fcbfe3f3f3f3fc4b93fe0e2a1fc5e
UTF-8 淫렪珽블렎흥김長렚珥◇淫렪珽블렎흥김長렚珥●^ 11100110101101111010101111101011101000001010101011100111100011111011110111101011101110001001010011101011101000001000111011101101100111011010010111101010101110011000000011101001100101011011011111101011101000001001101011100111100011111010010111100010100101111000011111100110101101111010101111101011101000001010101011100111100011111011110111101011101110001001010011101011101000001000111011101101100111011010010111101010101110011000000011101001100101011011011111101011101000001001101011100111100011111010010111100010100101111000111101011110 e6b7abeba0aae78fbdebb894eba08eed9da5eab980e995b7eba09ae78fa5e29787e6b7abeba0aae78fbdebb894eba08eed9da5eab980e995b7eba09ae78fa5e2978f5e
UHC 淫렪珽블렎흥김長렚珥◇淫렪珽블렎흥김長렚珥●^ 111010111110001010001110101110001110111111101010101110101110110110001110101001001100100011101111101100011110100011101101111111101000111010101101111011001011010010100001110111101110101111100010100011101011100011101111111010101011101011101101100011101010010011001000111011111011000111101000111011011111111010001110101011011110110010110100101000011101110001011110 ebe28eb8efeabaed8ea4c8efb1e8edfe8eadecb4a1deebe28eb8efeabaed8ea4c8efb1e8edfe8eadecb4a1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)