: What are proper process to disallow a site from google crawl I have done google and get the two solution for disallow whole a site from google crawl. 1: User-agent: * Disallow:

Posted in: #Google #RobotsTxt #WebCrawlers

I have done google and get the two solution for disallow whole a site from

google crawl.

1:

User-agent: *
Disallow:

2:

User-agent: *
Disallow: /

Now can anyone tell me which proper code for disallow whole site from Google crawl

10.01% popularity Vote Up Vote Down

: Is it possible to exclude Data-URIs (data:image/xxx) through robots.txt? At the moment Google (I can not find similar informations for Bing, Yahoo!, etc.) does not index Data-URI. But until it

@Kimberly868

Posted in: #DataUri #RobotsTxt

2 Comments

: Why would certain metrics drop in half while others double? (iPad only) I'm a developer on a website. Normally I don't work with analytics much, but we rolled a recent update that's causing

@Kimberly868

Posted in: #GoogleAnalytics #Ipad

1 Comments

: SEO advantages of /blog when forwarding via ProxyPass We've had requests from clients via their SEO companies to host blogs accompanying ecommerce sites on theirdomain.com/blog rather than on -

@Kimberly868

Posted in: #Apache #Blog #Seo #WebHosting #Wordpress

1 Comments

: What are the "host record" options available? When setting up a new A Record / Host Record there's an option to enter the "host", and then one to enter the "IP". What are the options available

@Kimberly868

Posted in: #Dns

1 Comments

Login to post a comment!

1 Comments

Sorted by latest first Latest Oldest Best

@Alves908

To prevent your whole site from being crawled, then No. 2:

User-agent: *
Disallow: /

This blocks every URL from being crawled. The URL-path following the Disallow: directive is a prefix. If the requested URL starts with this URL-path, it will be blocked. The minimum URL path you can have is / (your home page / document root) - you can't have an empty path (as suggested in comments). When you request example.com, the browser actually requests example.com/ to make the request valid. See my other answer for more information on the trailing slash.

Disallow: by itself (without a path) actually allows everything - the complete opposite!

Reference: www.robotstxt.org/robotstxt.html

10% popularity Vote Up Vote Down

Feed

: What are proper process to disallow a site from google crawl I have done google and get the two solution for disallow whole a site from google crawl. 1: User-agent: * Disallow:

More posts by @Kimberly868

: Is it possible to exclude Data-URIs (data:image/xxx) through robots.txt? At the moment Google (I can not find similar informations for Bing, Yahoo!, etc.) does not index Data-URI. But until it

: Why would certain metrics drop in half while others double? (iPad only) I'm a developer on a website. Normally I don't work with analytics much, but we rolled a recent update that's causing

: SEO advantages of /blog when forwarding via ProxyPass We've had requests from clients via their SEO companies to host blogs accompanying ecommerce sites on theirdomain.com/blog rather than on -

: What are the "host record" options available? When setting up a new A Record / Host Record there's an option to enter the "host", and then one to enter the "IP". What are the options available

Login to post a comment!

1 Comments

Back to top | Use Dark Theme