: Prevent crawler that doesn't honoring robots.txt I have some problem, when I try to write robots.txt for my site ... I find some issues by search on Google, and tell me about honor and not

Posted in: #Indexing #RobotsTxt #WebCrawlers

I have some problem, when I try to write robots.txt for my site ...

I find some issues by search on Google, and tell me about honor and not honoring robots.txt, how I can prevent it, can I perform it with .htaccess or other way ?

10.02% popularity Vote Up Vote Down

: Tuning WebServer Response - I have this sam e question on StackOverflow and I was advised to ask it here hoping for more information. Here is the question: I am in rather unfavorable situation.

@YK1175434

Posted in: #AmazonEc2 #Iis7

2 Comments

: Is having a 'home' navigation item on the home page negative to your sites SEO? Possible Duplicate: Do search engines penalise ‘Home’ links and/or buttons? My work colleague has

@YK1175434

Posted in: #Homepage #Links #Seo

4 Comments

: A technical pattern which includes existing functionalities.

@YK1175434

0 Comments

: Position of a site in results of search engines.

@YK1175434

0 Comments

Login to post a comment!

2 Comments

Sorted by latest first Latest Oldest Best

@Jessie594

Simple: Ban them all! With PHP and Regex. For example:

if (preg_match('/(?i)badbot1|badbot2|badbot3/',$_SERVER['HTTP_USER_AGENT'])){

header ('HTTP/1.1 403 Forbidden');
exit();
}

The header statement is optional

Be careful, never close the last "badbot" with a pipe "|". If you do, you ban all your traffic!
So, use "badbot1|badbot2|badbot3".

Never "|badbot1|badbot2|badbot3" and
Never "badbot1|badbot2|badbot3|"

Good luck

10% popularity Vote Up Vote Down

@Jessie594

If there are crawlers not following your robots.txt rules you will need to ban them by IP. Placing their user agent's into your robots.txt to ban does nothing if they aren't following it's rules.

10% popularity Vote Up Vote Down

Feed

: Prevent crawler that doesn't honoring robots.txt I have some problem, when I try to write robots.txt for my site ... I find some issues by search on Google, and tell me about honor and not

More posts by @YK1175434

: Tuning WebServer Response - I have this sam e question on StackOverflow and I was advised to ask it here hoping for more information. Here is the question: I am in rather unfavorable situation.

: Is having a 'home' navigation item on the home page negative to your sites SEO? Possible Duplicate: Do search engines penalise ‘Home’ links and/or buttons? My work colleague has

: A technical pattern which includes existing functionalities.

: Position of a site in results of search engines.

Login to post a comment!

2 Comments

Back to top | Use Dark Theme