: How does "Noindex:" in robots.txt work? I ran across this article in my SEO news today. It seems to imply that you can you use Noindex: directives in addition to the standard Disallow:

Posted in: #Googlebot #Noindex #RobotsTxt #WebCrawlers

I ran across this article in my SEO news today. It seems to imply that you can you use Noindex: directives in addition to the standard Disallow: directives in robots.txt.

Disallow: /page-one.html
Noindex: /page-two.html

Seems like it would prevent search engines from crawling page one, and prevent them from indexing page two.

Is this robots.txt directive supported by Google and other search engines? Does it work? Is it documented?

10.01% popularity Vote Up Vote Down

: It depends on the type of certificate you buy. Common certificates only work for exactly one sub-domain. (example.com or www.example.com). You can pay extra for a wildcard certificate

@Eichhorn148

0 Comments

: If I already have server logs & Google Webmaster Tools, do I still need Analytics? I have a website where I have access to full server logs, which we download and analyze, and I've also set

@Eichhorn148

Posted in: #GoogleAnalytics #GoogleSearchConsole

1 Comments

: When data is contextual on time, will search engines penalize a site for providing crawlers with all of the data instead of the contextual data? I'm working on an eCommerce website where we

@Eichhorn148

Posted in: #Seo

1 Comments

: Htaccess ErrorDocument 500 not working I have a site whose .htaccess file contains: ErrorDocument 404 /errors/404.html ErrorDocument 500 /errors/500.html The 404 redirect works just fine, but when

@Eichhorn148

Posted in: #Htaccess #HttpCode500

2 Comments

Login to post a comment!

1 Comments

Sorted by latest first Latest Oldest Best

@BetL925

Here is what Google's John Mueller says about Noindex: in robots.txt:

We used to support the no-index directive in robots.txt
as an experimental feature.
But it's something that I wouldn't rely on.
And I don't think other search engines are using that at all.

deepcrawl.com has done some testing of the feature and discovered that:

It still works with Google
It does prevent URLs from appearing in the search index
URLs that have been noindexed in robots.txt are marked as such in Google Search Console

Given that Google calls the feature "experimental" and has not officially documented it, I wouldn't recommend using it. It sounds like even if it works today, that support could be removed at any time.

Instead, use robots meta tags that are well supported and documented to prevent indexing:

<meta name="robots" content="noindex" />

10% popularity Vote Up Vote Down

Feed

: How does "Noindex:" in robots.txt work? I ran across this article in my SEO news today. It seems to imply that you can you use Noindex: directives in addition to the standard Disallow:

More posts by @Eichhorn148

: It depends on the type of certificate you buy. Common certificates only work for exactly one sub-domain. (example.com or www.example.com). You can pay extra for a wildcard certificate

: If I already have server logs & Google Webmaster Tools, do I still need Analytics? I have a website where I have access to full server logs, which we download and analyze, and I've also set

: When data is contextual on time, will search engines penalize a site for providing crawlers with all of the data instead of the contextual data? I'm working on an eCommerce website where we

: Htaccess ErrorDocument 500 not working I have a site whose .htaccess file contains: ErrorDocument 404 /errors/404.html ErrorDocument 500 /errors/500.html The 404 redirect works just fine, but when

Login to post a comment!

1 Comments

Back to top | Use Dark Theme