search&tour* Above is what I am using. Our URL looks like

Posted in: #Google #Googlebot #GoogleSearchConsole #RobotsTxt #WebCrawlers

Using robots.txt I am attempting to stop all crawling of search URLs

Disallow: /rest_of_url/search&tour*

Above is what I am using. Our URL looks like the following for all search results. However, everything after search&tour can be different, for example:
www.example.com.au/rest_of_url/search&tour-sdfs=the-palce+lcation+&tour-duration=1/

Will the Disallow code above stop robots from crawling all of my search result pages?

10.02% popularity Vote Up Vote Down

: Is it bad to add "the" to domain name if the domain name is not avaliable? (Updated) I am designing a website that is like an online magazine about a product and unfortunately product.com

@Welton855

Posted in: #Domains #GoogleSearchConsole #Search #Seo #WebHosting

3 Comments

: Should Google news and Google sitemap index contains all posts in the site? I have a big database of 5M post, and I have sitemap index like this but without lastmod, and because of this large

@Welton855

Posted in: #Sitemap

0 Comments

: Tracking referrals from non-internet resources One can see the URL of the site (also called referrers) which pushed the visitor to your site. The site owner can also share links to non-website

@Welton855

Posted in: #GoogleAnalytics #Referrer

1 Comments

: What are the different types of XML sitemaps that can be submitted to search engines? I recently came to know about Image sitemaps. Can someone list ALL possible types of sitemaps currently

@Welton855

Posted in: #Seo #Sitemap

1 Comments

Login to post a comment!

2 Comments

Sorted by latest first Latest Oldest Best

@Alves908

Will the Disallow code above stop robots from crawling all of my search result pages?

Yes, it will stop the (good) bots that obey the robots.txt "standard".

However, you don't need the trailing *. robots.txt is prefix matching, so the "wildcard" * at the end can simply be omitted. (Wildcard type matches are an extension of the original standard anyway.)

And you obviously need the User-agent directive that precedes this rule, if you haven't got it already:

User-agent: *
Disallow: /rest_of_url/search&tour

10% popularity Vote Up Vote Down

@Heady270

Disallow sets the files or folders that are not allowed to be crawled.

In addition, you can prevent a page from appearing in Google Search by including a noindex meta tag in the page's HTML code. When Googlebot next crawls that page, Googlebot will see the noindex meta tag and will drop that page entirely from Google Search results, regardless of whether other sites link to it.

10% popularity Vote Up Vote Down

Feed

: Disallow Crawling of All Search Pages Using robots.txt I am attempting to stop all crawling of search URLs Disallow: /rest_of_url/search&tour* Above is what I am using. Our URL looks like

More posts by @Welton855

: Is it bad to add "the" to domain name if the domain name is not avaliable? (Updated) I am designing a website that is like an online magazine about a product and unfortunately product.com

: Should Google news and Google sitemap index contains all posts in the site? I have a big database of 5M post, and I have sitemap index like this but without lastmod, and because of this large

: Tracking referrals from non-internet resources One can see the URL of the site (also called referrers) which pushed the visitor to your site. The site owner can also share links to non-website

: What are the different types of XML sitemaps that can be submitted to search engines? I recently came to know about Image sitemaps. Can someone list ALL possible types of sitemaps currently

Login to post a comment!

2 Comments

Back to top | Use Dark Theme