Mobile app version of vmapp.org
Login or Join
Ravi8258870

: Why is Google indexing pages with ?utm_campaign query string Recently I have noticed that Google is indexing URLs containing utm_campaign, utm_source and utm_medium query string arguments. In the

@Ravi8258870

Posted in: #CanonicalUrl #Google #Serps

Recently I have noticed that Google is indexing URLs containing utm_campaign, utm_source and utm_medium query string arguments. In the results, Google shows URLs with these query strings rather than the canonical URL:



I understand this could be a 'duplicate content' issue, but I have been using the link rel=canonical tag throughout my website. As one example:

[snip]
<meta name="description" content="App store optimization helps discovery and rank on Google Play and the App Store. Publishers can use ASO by following these simple steps."/>
<meta name="robots" content="noodp"/>
<link rel="canonical" href="https://sweetpricing.com/blog/2016/11/app-store-optimization/" />
<meta property="og:locale" content="en_US" />
[snip]




My expectation is that Google should be using the canonical URL for indexing. What am I doing wrong?

10.03% popularity Vote Up Vote Down


Login to follow query

More posts by @Ravi8258870

3 Comments

Sorted by latest first Latest Oldest Best

 

@Angela700

Reviewing your website as it is now, I am not too sure if this is a problem any longer / currently.

The issue is not internal linkage on your website with the inclusion of UTM parameters (as another question suggests).

It seems more like some process you have to share your website content on social media is leaving the UTM parameters in the URLs and sharing those URLs which has, at some point, led to them being indexed.

It is rare that this happens, but it has happened to many other sites before. The fact that it is only three pages indexed with these parameters is indicative that this is neither a serious issue nor a sitewide one.

Here are the steps you can take to help eradicate this from happening:-

1. Specify a canonical URL on your pages

You are already doing this and the implementation is correct. This will ensure only the specified canonical URL will be given weight in search engines. Presumably this has always been in place but if not, then this could explain why there are some old instances of pages still indexed with UTM parameters.



2. Instruct Google not to index the UTM parameters in Search Console

In the event that some URLs are being indexed with the UTM parameters (like your case), the URL parameter should appear as a detected one from within the 'Crawl > URL Parameters' section of Google Search Console for your domain (see below).



Even if the UTM parameters do not appear, you can 'Add Parameter' to create them.

Simply select No: Doesn't affect page content (ex: tracks usage) (known as 'Passive Parameters') and Google will then usually only crawl just one URL with a specific parameter value.

3 Disallow the URL parameters in your robots.txt

This will block Google from indexing the content of these URLs but not the actual URLs themselves (they could still display in the search results but will just omit the description like below).



Simply adding something like the following would handle this from robots.txt:-

Disallow: /*?utm=*


Conclusion

Steps #1 and #2 should be carried out as a matter of precaution and "best practice" anyhow and step #3 in addition to steps #1 and #2 perhaps (as won't be effective on its own).

Within Google Search Console, there is also the ability to (temporarily) remove URLs. This is particularly useful if there are some stubborn pages still indexed but you know the root source of the issue has been resolved and this facility should be enough to rid of them once and for all from the search results.

I have not included this as a step above as, despite having researched this before, I cannot recall whether it will support URLs with parameters [citation needed]. I once knew the answer but my memory fails me on this particular occasion.

More reading on the removal of URLs from Google.

10% popularity Vote Up Vote Down


 

@Goswami781

It seems that you are using these links inside the content of your website to link pages together.

To make sure Google won't index, you can add rel="nofollow" to these links inside your website and block these params from robots.txt file:

Disallow : /*utm_campaign

10% popularity Vote Up Vote Down


 

@Bryan171

Have you checked that is your canonical URL has been indexed or not?
If the canonical URL has been indexed then there is nothing to worry about.

You can try Google's Webmaster Tools and change how Google handles URL parameters here.

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme