Mobile app version of vmapp.org
Login or Join
Correia994

: Will adding nofollow to alternate language links prevent Googlebot from crawling so many of them? I have a site with many pages (30MM+ indexed) and have been getting a extremely high number

@Correia994

Posted in: #Hreflang #Language #Seo

I have a site with many pages (30MM+ indexed) and have been getting a extremely high number of URLs on your site warning message for a long time. I'm thinking of reducing the numbers of URLs Google sees especially between similar pages, specifically on pages that have a different language template but have the same content.

Current for every page I have this in my <head>:

<link rel="alternate" hreflang="en x-default" href="http://mydomain.com/dir/page" />
<link rel="alternate" hreflang="ja" href="http://ja.mydomain.com/dir/page" />
<link rel="alternate" hreflang="ko" href="http://ko.mydomain.com/dir/page" />
<link rel="alternate" hreflang="th" href="http://th.mydomain.com/dir/page" />
<link rel="alternate" hreflang="es" href="http://es.mydomain.com/dir/page" />
<link rel="alternate" hreflang="pt" href="http://pt.mydomain.com/dir/page" />


Will adding a rel=nofollow or a robots.txt disallow help Google focus on the more important pages he has to crawl instead of crawling a different template of the same page?

10.01% popularity Vote Up Vote Down


Login to follow query

More posts by @Correia994

1 Comments

Sorted by latest first Latest Oldest Best

 

@BetL925

Adding nofollow generally does not prevent Googlebot from discovering content. See: Does a "nofollow" attribute on a link prevent URL discovery by search engines?

If the nofollow link is the only link to that page, then Google won't index it. In your case, I highly doubt that your alternate language links are the only links into your other sites. Once Google finds a dofollow link to any page on the alternate language subdomain, they would be able to discover the rest with dofollow links there.

The "extremely high number of URLs on your site" warning from webmaster tools is nothing to worry about if your site is supposed to have that many pages. Google is more worried about cases such as:


A site with session ids in the URL that make the site appear to have a different URL every time Google crawls it
Sites that can auto-generate a page for any URL


Every site with millions of pages gets this message in Webmaster Tools. There is no problem with a site having that many pages if it has the content to back it up. I've worked with several such sites myself. None of them have had trouble with rankings because of it.

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme