Mobile app version of vmapp.org
Login or Join
Shakeerah822

: Google indexing titles of login only pages Google is indexing some pages which require login on our site, but just the title of each page. I am quite baffled as to how this can happen, Has

@Shakeerah822

Posted in: #Google

Google is indexing some pages which require login on our site, but just the title of each page.

I am quite baffled as to how this can happen, Has anyone experienced something similar?

One thing that comes to my mind is Google Analytics js sending the title of the page to Google, but that doesn't make sense.

We are also using some other 3rd party like optimizely, which might be causing this.

Any ideas what might be causing Google to index the titles of these pages?

10.02% popularity Vote Up Vote Down


Login to follow query

More posts by @Shakeerah822

2 Comments

Sorted by latest first Latest Oldest Best

 

@Eichhorn148

If the pages in question are in robots.txt, Google still may index the pages but use the anchor text of any inbound links as the page title. My guess is that you have links to these pages that have anchor text that is identical or similar to the page titles. Clearly if Google can't crawl the page they can't get the page contents, so they aren't going to index that.

If you don't want these pages to appear in search results you actually have to let Googlebot crawl them and return the appropriate meta robots tag

<meta name="robots" content="noindex, nofollow">


Googlebot would also be unlikely to index these URLs if they redirect to your login page and Googlebot can see that by crawling them. Especially if your login page has the robots noindex meta tag.

10% popularity Vote Up Vote Down


 

@Turnbaugh106

This is purely down to the way your site is operating, the titles are being processed without requiring authentication. As long as the rest of the page is not rendering then its not a serious problem and since Google isn't indexing the contents of the page then its a good sign that this isn't the case.

The mistake here is that you have not blocked the administrator/private areas of the site.

You should use both of these methods to stop Google even attempting to index these pages,

On all administrator/private pages use:

<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">


Additional a extra layer of protection against indexing is using the robots.txt, you can block Google and others indexing these pages by using:

User-agent: *
Disallow: /administrator/

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme