: Suspicious crawler activity I'm noticing that I get accesses 66.249.66.198 - - [01/Jul/2011:17:13:46 +0200] "GET /img/clip.incubus.torrent.phtml HTTP/1.1" 404 143 "-" "Mozilla/5.0 (compatible; Googlebot/2.1;
I'm noticing that I get accesses
66.249.66.198 - - [01/Jul/2011:17:13:46 +0200] "GET /img/clip.incubus.torrent.phtml HTTP/1.1" 404 143 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.66.198 - - [01/Jul/2011:17:13:48 +0200] "GET /img/clip.global.deejays.download.phtml HTTP/1.1" 404 143 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
that files don't exist and there is no file on my site that has this content (I hope).
Why is googlebot trying out these links?
reverse dns and whois state that 66.249.66.198 is really googlebot.
More posts by @Sent6035632
2 Comments
Sorted by latest first Latest Oldest Best
If domain name is new (you recently bought it) then Google may still try to get old URLs (not because it remembers those links (it will de-index such not-working-for-a-long-time links) but because there is still a site that has links to this domain)
Wherever those links were published there maybe a typo in URL (domain name) and they are pointing to your domain (e.g. hellokitten.com instead of hellkitten.com).
I very doubt that it is Google checking for malware using SO many different URLs which look very similarly.
The URLs you showed so far suggest they are probably related to a torrent site. As we know Google does not like torrent sites any more. Maybe it had some suspicion (don't know what it may be) and checks your site for some common URLs that it founds on other sites.
Terms of Use Create Support ticket Your support tickets Stock Market News! © vmapp.org2024 All Rights reserved.