: Robots.txt and Regular Expressions If I place a regular expression in robots.txt will it be readable for the web crawlers?
If I place a regular expression in robots.txt will it be readable for the web crawlers?
More posts by @Ogunnowo487
2 Comments
Sorted by latest first Latest Oldest Best
While it's not part of the standard robots.txt protocol (and therefore not globally recognized), Google and Bing both support LIMITED pattern matching.
For example:
User-agent: Googlebot
Disallow: /private*/
Will block Googlebot from any directory beginning with "private". Currently, Googlebot supports * and $ (end of string).
Details on Googlebot's pattern matching: www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
No, the robots.txt syntax only supports wildcards, and not regular expressions. See here for a good discussion of the syntax: www.robotstxt.org/robotstxt.html.
Terms of Use Create Support ticket Your support tickets Stock Market News! © vmapp.org2024 All Rights reserved.