Mobile app version of vmapp.org
Login or Join
Harper822

: Google is ignoring my robots.txt Here is content of my robots.txt file: User-agent: * Disallow: /images/ Disallow: /upload/ Disallow: /admin/ As you can see, I explicitly disallowed all robots to

@Harper822

Posted in: #Google #GoogleSearch #RobotsTxt #Seo

Here is content of my robots.txt file:

User-agent: *
Disallow: /images/
Disallow: /upload/
Disallow: /admin/


As you can see, I explicitly disallowed all robots to index the folders images, upload and admin. The problem is that one of my clients sent request for removing the content from the images folder because .pdf document from the images folder appeared in the google search results. Can anyone explain me what I'm doing wrong here, and why google indexed my folders?

Thx!

10.03% popularity Vote Up Vote Down


Login to follow query

More posts by @Harper822

3 Comments

Sorted by latest first Latest Oldest Best

 

@Annie201

No Allow code?
Try add

User-agent: *
Allow: /
Disallow: /images/
Disallow: /upload/
Disallow: /admin/

10% popularity Vote Up Vote Down


 

@Angie530

While Google won’t crawl or index the content of pages blocked by robots.txt, we may still index the URLs if we find them on other pages on the web. As a result, the URL of the page and, potentially, other publicly available information such as anchor text in links to the site can appear in Google search results.

User-agent: *
Disallow:/images/

According to the above rule, all robots are denied access to the /images/ folder of a site. Additionally, a specific rule can be set to explicitly disallow access to all files within a folder:

Disallow:/images/*

10% popularity Vote Up Vote Down


 

@Sarah324

Google isn't ignoring your robots.txt as robots.txt does not tell Google not to list that content in its search results. It tells Google not to crawl content. So Googlebot cannot find this content on its own. But if it does find this content it will still list it in the search results.

To keep content out of the search results you need to use the x-robots-tag HTTP header. It tells search engines not to index that content. If you're using Apache you can place a file called .htaccess in each of the mentioned directories with the following line in it:

X-Robots-Tag: noindex


That will prevent the content of those directories to not appear in the search results.

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme