Mobile app version of vmapp.org
Login or Join
Gretchen104

: Is this a proper way to deal with near duplicate content? I have a website which used to have 22.000 pages with near duplicate content. Of course, this has become a Panda issue. These pages

@Gretchen104

Posted in: #CanonicalUrl #DuplicateContent #Indexing

I have a website which used to have 22.000 pages with near duplicate content. Of course, this has become a Panda issue. These pages are now gone from my site and the Google index.

I have decided to simplify the 22.000 pages into a unique page with a query parameter which can take 22.000 values. Due to some category pages and a proper link structure, Google can reach this page in 22.000 different ways.

Of course, this still makes 22.000 different URLs. In order to solve this issue, I have set the page with a canonical URL.

I don't want to spam Google index, but I need to server my customers too.

My question is: can someone confirm that making my unique page - called 22.000 times with different parameter values - canonical is a proper solution to my near duplicate content issue? If not, how should I solve it?

Update

Here is what I mean by being called 22.000 times:
www.mysite.com/product_descr.html?pid=0 http://www.mysite.com/product_descr.html?pid=1 www.mysite.com/product_descr.html?pid=2 http://www.mysite.com/product_descr.html?pid=3 www.mysite.com/product_descr.html?pid=4 ...


In the product_descr.html page, I have:

<link rel="canonical" href="http://www.mysite.com/product_descr.html"/>


Update II

I found some Google documentation confirming that canonical is a solution.

10.01% popularity Vote Up Vote Down


Login to follow query

More posts by @Gretchen104

1 Comments

Sorted by latest first Latest Oldest Best

 

@Sarah324

The solution you're about to implement, if that can be called a solution, will work.

But before you actually implement or publish it in your website, let me give you some advice (I have a site with 90k indexed pages, and when it was about 10k, we had the same exact problem).

First of all, leave your old pages accessible and 301 redirected to your new pages

Just as important, Inside Webmaster Tools set pid parameter as Parameter that change or determine the content of a page. You'll get explained how to do it here. Do this even before publishing that changes to the production server.

Another thing you can do, which I strongly recommend you, is to build friendly and descriptive urls to solve this. Use your product name as part of the URL. If your in php & apache, you can use .htaccess or programmatically to reach it.

My last advice will be to visit Webmaster Tools for your site everyday, to check for 404 links in Google SERPs and the indexing rate.

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme