Mobile app version of vmapp.org
Login or Join
Speyer207

: 404 Hits from "PHPCrawl" on WP site Forgive me if this is in the wrong area - I thought considering the topic, it belonged here (as opposed to in WP). Correct me or move it if I am wrong.

@Speyer207

Posted in: #UserAgent #Wordpress

Forgive me if this is in the wrong area - I thought considering the topic, it belonged here (as opposed to in WP). Correct me or move it if I am wrong.

I've been looking around one of the sites I control (one we've been having all kinds of little problems with) and checked my 404 logs. Usually I expect to see the generated captcha images and other random entries, but today I saw several of the site's pages on there with "/(" appended at the end, all with "PHPCrawl" as the User Agent.

As far as I can figure, this is an open source script available for developers to use at their discretion. I've not used anything of the sort. I don't believe anyone else in control of the site has.

Is it possible one of the plugins could have caused these entries? Is it automated from some search engine? Is it something I should be worried about hurting the site or its SEO?

10.01% popularity Vote Up Vote Down


Login to follow query

More posts by @Speyer207

1 Comments

Sorted by latest first Latest Oldest Best

 

@Gretchen104

Is it possible one of the plugins could have caused these entries?


Highly unlikely unless one of your plugins is attempting to index the site. But the plugins have access to the wp-posts table so there isn't any need to spider via the front end.


Is it automated from some search engine?


Potentially, but not one of the major ones as those Spiders are easily and readily identifiable in your logs. I can't think of any legitimate search engine that would use PHPCrawl. You should start tracing the IP addresses that are causing the 404 errors.


Is it something I should be worried about hurting the site or its SEO?


There's not really enough information to answer this definitively. Someone is crawling your site via a script but there's no way to know why or to what purpose they will use the data. Should you be worried? Probably not...this kind of stuff happens all the time. If the IP address proves to be from an area that you are not concerned about, add a deny line to your .htaccess file and block them. Will it affect SEO? Again, most likely not. It is remotely possible that someone is spidering pages to create a spammy page of links but careful checking of your analytics will reveal any weird backlinks and you can then handle those easily enough.

tl;dr

The situation bears watching but is unlikely to be a problem.

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme