: How to disallow indexing but allow crawling? In the front page of my website, I have some previews to articles (with a small introduction to them) that link to the full articles. I want to

Posted in: #DuplicateContent #Googlebot #Indexing #RobotsTxt #WebCrawlers

In the front page of my website, I have some previews to articles (with a small introduction to them) that link to the full articles.
I want to disallow the front page to prevent duplicate content. But if I do this (in robots.txt), would it still be crawled?

I mean, the full articles would be still reached by the crawler even though I disallowed the only page that links to them?

I don't want the webcrawler not to access the page and enter the links in them, but I just don't want it to save the information (that will be repeated in the full articles).

10.01% popularity Vote Up Vote Down

: How to make Google recognize language for a multilingual website? A few weeks ago I implemented a translation functionality for my company website. The website now offers content in french and

@Margaret670

Posted in: #Google #GoogleSearch #Html #Multilingual #Seo

2 Comments

: How to check that Google Analytics Tracking Code is firing on an iPad I am used to using the Firebug extension "Omnibug" with Firefox to check that Google Analytics Tracking Code is firing

@Margaret670

Posted in: #Firebug #GoogleAnalytics #Ipad #Safari #Tracking

4 Comments

: Why my domain redirect on Google Apps is returning 404? I have a configuration in the Google Apps Control Panel (dcc.securepaynet.net) to redirect tombrito.com to http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4499244H

@Margaret670

Posted in: #GoogleApps #Redirects

1 Comments

: Why would URLs submitted in Google Webmaster Tools drop to 0? Why would URLs submitted in Google Webmaster Tools drop to 0? It's a small site, only like 20 pages, I submitted the XML sitemap

@Margaret670

Posted in: #GoogleSearchConsole

1 Comments

Login to post a comment!

1 Comments

Sorted by latest first Latest Oldest Best

@Si4351233

That is what the robots meta tag is for, control per page for indexing and following.

I've come to prefer it over using robots.txt as it gives finer control.

For your page, you'd want noindex,follow for the setting. The robot will read the page, not index it, but follow all the links off the page.

<meta name="robots" content="noindex,follow" />

10% popularity Vote Up Vote Down

Feed

: How to disallow indexing but allow crawling? In the front page of my website, I have some previews to articles (with a small introduction to them) that link to the full articles. I want to

More posts by @Margaret670

: How to make Google recognize language for a multilingual website? A few weeks ago I implemented a translation functionality for my company website. The website now offers content in french and

: How to check that Google Analytics Tracking Code is firing on an iPad I am used to using the Firebug extension "Omnibug" with Firefox to check that Google Analytics Tracking Code is firing

: Why my domain redirect on Google Apps is returning 404? I have a configuration in the Google Apps Control Panel (dcc.securepaynet.net) to redirect tombrito.com to http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4499244H

: Why would URLs submitted in Google Webmaster Tools drop to 0? Why would URLs submitted in Google Webmaster Tools drop to 0? It's a small site, only like 20 pages, I submitted the XML sitemap

Login to post a comment!

1 Comments

Back to top | Use Dark Theme