Mobile app version of vmapp.org
Login or Join
Margaret670

: How to disallow indexing but allow crawling? In the front page of my website, I have some previews to articles (with a small introduction to them) that link to the full articles. I want to

@Margaret670

Posted in: #DuplicateContent #Googlebot #Indexing #RobotsTxt #WebCrawlers

In the front page of my website, I have some previews to articles (with a small introduction to them) that link to the full articles.
I want to disallow the front page to prevent duplicate content. But if I do this (in robots.txt), would it still be crawled?

I mean, the full articles would be still reached by the crawler even though I disallowed the only page that links to them?

I don't want the webcrawler not to access the page and enter the links in them, but I just don't want it to save the information (that will be repeated in the full articles).

10.01% popularity Vote Up Vote Down


Login to follow query

More posts by @Margaret670

1 Comments

Sorted by latest first Latest Oldest Best

 

@Si4351233

That is what the robots meta tag is for, control per page for indexing and following.

I've come to prefer it over using robots.txt as it gives finer control.

For your page, you'd want noindex,follow for the setting. The robot will read the page, not index it, but follow all the links off the page.

<meta name="robots" content="noindex,follow" />

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme