Mobile app version of vmapp.org
Login or Join
Miguel251

: Another website is mirroring and ranks above my site in search results There is a site of ill-repute known as thedirty which has completely mirrored my site and now has links appearing on Google

@Miguel251

Posted in: #Google #GoogleSearch #Mirror #Seo

There is a site of ill-repute known as thedirty which has completely mirrored my site and now has links appearing on Google at the #1 spot using my content. I checked my log files and noticed that this site has been crawling mine for sometime, and also has 10,000 links from their site to mine.

I have blocked user access which is referred from this site and reported them as web spam to Google already. I also disavowed the domain.

How are they getting top links in Google (even overtaking mine) for such nefarious tactics? What are the steps to completely eliminating an issue such as this?

UPDATE 8/28/2014:

I thought I would provide an update on this as I have more information now. So thedirty pointed their subdomains to my ip which had the effect of making their subdomains look like my website.

For a couple of days this didn't matter much because using htaccess, I redirected all HOSTS not of my domain, back to my domain which basically meant I was getting their subdomains traffic links on Google. After a couple of days thedirty changed their subdomains to point back to their website so that I no longer benefit from this.

So the whole point being they used my content to get top ranks on Google, and are now pointing those links back to their website to drive more traffic to theirs.

It is a dirty tactic by a dirty website. My hope is that Google punishes such behavior.

10.06% popularity Vote Up Vote Down


Login to follow query

More posts by @Miguel251

6 Comments

Sorted by latest first Latest Oldest Best

 

@Sue5673885

This is called a Google Proxy Hack, and it happened to me as well.

First things first:


Submit a DMCA complaint to the Web Host. Use this link to create a correctly formed complaint, and send it to the host's support or abuse email. If the host is in the US, they must take down the site. Even if they are not based in the US, they may choose to take down the site anyway. (That happened to me once.)
Use Google DMCA tool to request the mirrored URL's be removed from its search results.
Use Google's Scraper Report to report the failure in Google's algorithm.


Fundamentally though, this a failure on Google's part. For all that they say about ranking being based on "quality original content", this is an absurdly simple counter-example that quite frankly is just embarrassing.

Hopefully if enough people complain about it, eventually Google will get its act together and write the 10 lines of code it takes to check that a site is an exact mirror of a previously established site.

Also, be aware that using canonical URLs do not always work in this instance. Many of these proxy scripts change the canonical URLs to point to the mirror site, thus rendering them useless.

Finally, be aware that they may have also spammed your main site with garbage links in order to damage your rankings. (This happened to me as well.)

If you do some searching and creative thinking, there are some ways to fight back. I really don't think it's a good idea to publish a complete list here because that just makes the hackers' lives easier.

10% popularity Vote Up Vote Down


 

@Lengel546

Like others have mentioned, filing a DMCA complaint and a copyright civil law suit may be the best options.

For the new content that you publish, you can consider notifying about your site updates on social media (Twitter, Facebook etc.) as soon as you post. The timestamp recorded there can be a fair indicator that you wrote first in case you have to prove it. Through Google Webmaster Tools, you can enforce a setting to have Googlebot crawl your site often. Assuming, popular search engines index your web-pages regularly (use the site operator, site:example.com, to find out) the date in the cached copy can be used as a rough indicator of when the content was published.

Also for the new content that you publish, you can embed watermarks within images and place comments within JS files indicating that you are the original owner of those files.

10% popularity Vote Up Vote Down


 

@Chiappetta492

A little late for you but best idea to protect your website (in the future) would be this:

(defcon 21, defense by numbers)
faking the return code so users will see the content but bots will


throw the content away
crawl in circles
stop working


other possible ideas - make sure that your users don't see any of this:


let them save GB of information (while there are only a few kb on your server)
make the bots flood their own memory with fake links
send fake content (100% boolsh*t - you need to write stuff - like
"Obama pregnant", "Spider-Man 5 - next summer", ... so your thieves can host
it...)
send fake files (like 42.zip, if they do not check the copied content their users will have fun --> AV tools will show that something is wrong --> users will be p*ss*d...)
let them wait for more data (file size = 1-10 MB and send random cr*p with 1 Byte/s or less)


other ideas:


links protected by Javascript (old, no use anymore? but if they remain unchanged users will be send to you (for a while))
dynamic garbage (use comments or invisble items to make the bots download stuff users can't see - good bots won't fall for this)
block ip adresses that download too much / too fast / the wrong way (bots do not behave like humans 1) every link on each page 2) there is a pattern or total chaos in the way they choose the next link(s))
use Javascript to redirect to your server if the files are not hosted by your server (no help against theft but the thieves have to remove it or their users won't stay on their page - you could code it into different routines (like content decryption))

10% popularity Vote Up Vote Down


 

@Heady270

You could track their IP (or IPs) and return totally different content for them to mirror - whatever you like. This way you get free space for advertising whatever, and you can use their high position in Google to your advantage.

I once used this to simply explain to the users on the mirrored website that this is the wrong domain. You can also post a simple HTTP redirect header.

10% popularity Vote Up Vote Down


 

@Ann8826881

You can file a DMCA complaint and if you are in the U.S., you can file a copyright civil law suit.

Here is a link to a short answer that explains how the DMCA complaint can help anyone:

Do you have to be in the United States to file a DMCA complaint?

... and another one the explains more...

How much of your content needs to be copied before you can file a DMCA complaint?

If you are in the U.S., you can hire a lawyer that is familiar with copyright issues and have them send a cease and desist letter. Give them 10 days (real days and not business days though business days would be fine too) to remove the content. You want to capture snap-shots of the offending site for your evidence and snap-shots of your site as well. If you check to see if a page has been removed, check the site directly and not search results.

If the page has not been removed within the time period, you can file a federal civil case that will take at least ,000 to defend and hardly cost you anything. You will be in the drivers seat. It is likely that a settlement can be had for at least ,000 and possibly more. You can get your costs back too. What is important is to offer a no-cost option to rectify the problem hence the cease and desist letter. After that, you are clear to file a case without responsibility to the defendant.

One other note is that you will need to demonstrate damages if you go to court. Loss of search traffic is damages. Here you will work with your attorney to collect metrics that illustrate loss of traffic and you will need to monetize the value of that traffic. Of course you can assume higher numbers here even with a %100 conversion rate. Just in case, I would be collecting metrics on traffic loss using Google Analytics and your log file analysis software today and into the future.

Please know that filing a case is not difficult or very expensive especially compared to the damages you are experiencing now and into the future. Copyright infringements have been going down lately, but those few that violate copyright are much bolder these days. We need to stop these people and the only real way is to put a cost factor into the business strategy that makes copyright infringement non-profitable.

10% popularity Vote Up Vote Down


 

@Kevin317

If they're just mirroring your site by feeding your site through a proxy script or regurgitating your HTML verbatum, you can add canonical URLs to your pages. This will let Google know your content is the original source and to show your URL in the search results, not their's.

Submit a DMCA request to Google. They're a little slow with them but they will ultimately remove those pages from their index.

Disavowing the links is a smart move.

I don't know if blocking the users is helpful though. Maybe putting a message at the top of your pages for them letting them know you are the original site and the other is a fraud might be a better solution.

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme