dis They part from a known resource (in the

I'm seeing in my access log a number of request very suspicious:

/i
/im
/imaa
/imag
/image
/images
/images/d
/images/di
/images/dis

They part from a known resource (in the above example /images/disrupt.jpg).

All comming from same IP. Requests varies from 1/sec to 10/sec, seems somewhat random.
It's obviously they are trying to find something and seems they are using a script.

How do I block this kind of behaviour? I though of blocking the IP request, at least for a given time.
Keeping in mind that:

Request intervals seems legitimate (at least I think so).
I don't want to end blocking a search engine bot, which may find 404 urls too (and that's a different problem, I know). ¿Do they use always same IP?

10.02% popularity Vote Up Vote Down

: Reach Local Proxy Page - Duplicate content? We have a client who has instructed Reach Local to manage their paid SEO work etc. RL have created a proxy version of the page at http://example-px.rtrk.co.uk

@Carla537

Posted in: #CanonicalUrl #DuplicateContent

1 Comments

: Moving a domain and staging url I have a website with web hosting I am planning on switching away from (Windows Server). So I got a hosting plan with a linux server and I was going to

@Carla537

Posted in: #Domains #Subdomain

1 Comments

: Real Time Push Server for Media I am currently using a real time Push server to push text and small binary data. I am looking for a similar service for Media. Pubnub, which I currently use,

@Carla537

Posted in: #Images #Media

1 Comments

: Making a site available at http and https after installing an SSL certificate I just had hostgator install an SSL certificate on my site. As a result, my site is only available (right now)

@Carla537

Posted in: #Http #Https

1 Comments

Login to post a comment!

2 Comments

Sorted by latest first Latest Oldest Best

@Carla537

Finally I found who was the responsible, it was a javascript that tried to load the resources in real time as somebody write an article.
As the user was typing the url of an image, the script tried to load it even if the path was not complete, hence that 404 logs.

10% popularity Vote Up Vote Down

@XinRu657

Do they use always same IP?

No, search engines can be expected to use a variety of IP addresses - but they do always use the same autonomous system (and all the major search engines have their own AS).

If you have the IP address, you can go to ARIN and use the "WHOIS Search" at the upper right-hand corner of the page to look up the autonomous system associated with the IP address.

10% popularity Vote Up Vote Down

Feed

: Blocking path scanning I'm seeing in my access log a number of request very suspicious: /i /im /imaa /imag /image /images /images/d /images/di /images/dis They part from a known resource (in the

More posts by @Carla537

: Reach Local Proxy Page - Duplicate content? We have a client who has instructed Reach Local to manage their paid SEO work etc. RL have created a proxy version of the page at http://example-px.rtrk.co.uk

: Moving a domain and staging url I have a website with web hosting I am planning on switching away from (Windows Server). So I got a hosting plan with a linux server and I was going to

: Real Time Push Server for Media I am currently using a real time Push server to push text and small binary data. I am looking for a similar service for Media. Pubnub, which I currently use,

: Making a site available at http and https after installing an SSL certificate I just had hostgator install an SSL certificate on my site. As a result, my site is only available (right now)

Login to post a comment!

2 Comments

Back to top | Use Dark Theme