You must list the Allow lines first as the file is read on first match basis.

To evaluate if access to a URL is allowed, a robot must attempt to
match the paths in Allow and Disallow lines against the URL, in the
order they occur in the record. The first match found is used. If no
match is found, the default assumption is that the URL is allowed.

Reference: www.robotstxt.org/norobots-rfc.txt
Google provides a tool in webmaster tools for testing your file. I always recommend testing your file. See "Test a site's robots.txt file:" near bottom.
support.google.com/webmasters/bin/answer.py?hl=en&answer=156449

10% popularity Vote Up Vote Down

@Sent6035632

User-agent: *
Disallow: /
Allow: /lessons/
Allow: /other-dir/

This does disallow the entire website, but explicitly allows given directories.

10% popularity Vote Up Vote Down

Feed

: How do you disallow root in robots.txt, but allow a subdirectory? Using robots.txt, how do you disallow the root of a site (http://www.example.com/) but allow a subdirectory (http://www.example.com/lessons/)?

More posts by @Dunderdale272

: Suggestions on managing social media accounts As a company we now have Facebook, LinkedIN, Twitter and now Google+, is there a way to easily manage all these accounts without having to log into

: You can always use a bandwidth monitor (e.g.: DU Meter, for Windows).

: Best way to track multiple sites with Google Analytics I currently have 63 websites (and counting) that I'm tracking on one Google Analytics account, and I'm starting to realize... this is becoming

: Favicon not showing in Safari 5.1 Is there a reason why a favicon.ico is not showing in Safari 5.1 on Mac? It shows on every other browser I have tested except the above mentioned one. In

Login to post a comment!

2 Comments

Back to top | Use Dark Theme