: Google indexing titles of login only pages Google is indexing some pages which require login on our site, but just the title of each page. I am quite baffled as to how this can happen, Has
Google is indexing some pages which require login on our site, but just the title of each page.
I am quite baffled as to how this can happen, Has anyone experienced something similar?
One thing that comes to my mind is Google Analytics js sending the title of the page to Google, but that doesn't make sense.
We are also using some other 3rd party like optimizely, which might be causing this.
Any ideas what might be causing Google to index the titles of these pages?
More posts by @Shakeerah822
2 Comments
Sorted by latest first Latest Oldest Best
If the pages in question are in robots.txt, Google still may index the pages but use the anchor text of any inbound links as the page title. My guess is that you have links to these pages that have anchor text that is identical or similar to the page titles. Clearly if Google can't crawl the page they can't get the page contents, so they aren't going to index that.
If you don't want these pages to appear in search results you actually have to let Googlebot crawl them and return the appropriate meta robots tag
<meta name="robots" content="noindex, nofollow">
Googlebot would also be unlikely to index these URLs if they redirect to your login page and Googlebot can see that by crawling them. Especially if your login page has the robots noindex meta tag.
This is purely down to the way your site is operating, the titles are being processed without requiring authentication. As long as the rest of the page is not rendering then its not a serious problem and since Google isn't indexing the contents of the page then its a good sign that this isn't the case.
The mistake here is that you have not blocked the administrator/private areas of the site.
You should use both of these methods to stop Google even attempting to index these pages,
On all administrator/private pages use:
<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">
Additional a extra layer of protection against indexing is using the robots.txt, you can block Google and others indexing these pages by using:
User-agent: *
Disallow: /administrator/
Terms of Use Create Support ticket Your support tickets Stock Market News! © vmapp.org2024 All Rights reserved.