To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 適?砥?阻麥?爰??‥適?砥?阻麥?爰??‥^ 1001001101001011001111111001001101110101001111111001000101101010111010100110110100111111111000001010011100111111001111111000000101100100100100110100101100111111100100110111010100111111100100010110101011101010011011010011111111100000101001110011111100111111100000010110010001011110 934b3f93753f916aea6d3fe0a73f3f8164934b3f93753f916aea6d3fe0a73f3f81645e
EUC-JP 適?砥?阻麥?爰?勖‥適?砥?阻麥?爰?勖‥^ 110001011010110000111111110001011101011000111111110000011100101111110011110011100011111111100000101010010011111110001111101100111110110110100001110001011100010110101100001111111100010111010110001111111100000111001011111100111100111000111111111000001010100100111111100011111011001111101101101000011100010101011110 c5ac3fc5d63fc1cbf3ce3fe0a93f8fb3eda1c5c5ac3fc5d63fc1cbf3ce3fe0a93f8fb3eda1c55e
UTF-8 適렖砥렫阻麥룬爰렪勖‥適렖砥렫阻麥룬爰렪勖‥^ 11101001100000011010100111101011101000001001011011100111101000001010010111101011101000001010101111101001100110001011101111101001101110101010010111101011101000111010110011100111100010001011000011101011101000001010101011100101100010111001011011100010100000001010010111101001100000011010100111101011101000001001011011100111101000001010010111101011101000001010101111101001100110001011101111101001101110101010010111101011101000111010110011100111100010001011000011101011101000001010101011100101100010111001011011100010100000001010010101011110 e981a9eba096e7a0a5eba0abe998bbe9baa5eba3ace788b0eba0aae58b96e280a5e981a9eba096e7a0a5eba0abe998bbe9baa5eba3ace788b0eba0aae58b96e280a55e
UHC 適렖砥렫阻麥룬爰렪勖‥適렖砥렫阻麥룬爰렪勖‥^ 111011101110101010001110101010111111001010110010100011101011100111110000111001101101100011101010101101111110100111101010101110101000111010111000111010011110110110100001101001011110111011101010100011101010101111110010101100101000111010111001111100001110011011011000111010101011011111101001111010101011101010001110101110001110100111101101101000011010010101011110 eeea8eabf2b28eb9f0e6d8eab7e9eaba8eb8e9eda1a5eeea8eabf2b28eb9f0e6d8eab7e9eaba8eb8e9eda1a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)