To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??砥?雋麥?爰??‥??砥?雋麥?爰??‥^ 001111110011111110010011011101010011111111101000101100101110101001101101001111111110000010100111001111110011111110000001011001000011111100111111100100110111010100111111111010001011001011101010011011010011111111100000101001110011111100111111100000010110010001011110 3f3f93753fe8b2ea6d3fe0a73f3f81643f3f93753fe8b2ea6d3fe0a73f3f81645e
EUC-JP 珽?砥?雋麥?爰?勖‥珽?砥?雋麥?爰?勖‥^ 1000111111001011111111100011111111000101110101100011111111110000101101001111001111001110001111111110000010101001001111111000111110110011111011011010000111000101100011111100101111111110001111111100010111010110001111111111000010110100111100111100111000111111111000001010100100111111100011111011001111101101101000011100010101011110 8fcbfe3fc5d63ff0b4f3ce3fe0a93f8fb3eda1c58fcbfe3fc5d63ff0b4f3ce3fe0a93f8fb3eda1c55e
UTF-8 珽렖砥렫雋麥룬爰렪勖‥珽렖砥렫雋麥룬爰렪勖‥^ 11100111100011111011110111101011101000001001011011100111101000001010010111101011101000001010101111101001100110111000101111101001101110101010010111101011101000111010110011100111100010001011000011101011101000001010101011100101100010111001011011100010100000001010010111100111100011111011110111101011101000001001011011100111101000001010010111101011101000001010101111101001100110111000101111101001101110101010010111101011101000111010110011100111100010001011000011101011101000001010101011100101100010111001011011100010100000001010010101011110 e78fbdeba096e7a0a5eba0abe99b8be9baa5eba3ace788b0eba0aae58b96e280a5e78fbdeba096e7a0a5eba0abe99b8be9baa5eba3ace788b0eba0aae58b96e280a55e
UHC 珽렖砥렫雋麥룬爰렪勖‥珽렖砥렫雋麥룬爰렪勖‥^ 111011111110101010001110101010111111001010110010100011101011100111110001111001101101100011101010101101111110100111101010101110101000111010111000111010011110110110100001101001011110111111101010100011101010101111110010101100101000111010111001111100011110011011011000111010101011011111101001111010101011101010001110101110001110100111101101101000011010010101011110 efea8eabf2b28eb9f1e6d8eab7e9eaba8eb8e9eda1a5efea8eabf2b28eb9f1e6d8eab7e9eaba8eb8e9eda1a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)