: Hide content from search engines (legitimate, not SEO related) I have a website which contains famous and original (made by users) quotes, poetries, and such. Recently many of our users start
I have a website which contains famous and original (made by users) quotes, poetries, and such. Recently many of our users start to complain because their name is searchable on Google since we have a very good indexing of the website (for every positive thing there is a negative side...)
Now we want to offer to users a "privacy" mode in the website, while retaining indexing for most pages: their name won't be indexed but only visible to users, while their texts will be both visible and indexed. Follow me: this isn't a SEO related question, this is legitimate hiding. We don't want people to be searchable on search engines, but only their texts. Why we want texts to be searchable? Because many use our website as "central repository" of their texts, which are copied by others on Facebook without retaining the author name: people searching for "anonymous" content have to find their page on our website and see the author name there.
Obviously the author profile would be completely blocked with meta tags, the only pages where this is relevant are those by topic, like this: www.pensieriparole.it/aforismi/politica/pag1 and other "general" pages like those containing latest insertions: www.pensieriparole.it/poesie/
What do you think is the best way to hide these names?
I heard recently Google is actually rendering some js, and using a document.write directly on the page is some sort of cloaking as far as I can tell. So i thought about an iframe with the author name as document fragment (#Author+Name) and a little js that does document.write there. This way I can block that iframe from being crawled with robots.txt, while allowing browsers to cache it, no matter which name it has to print, and ask for it only the first time.
More posts by @Margaret670
3 Comments
Sorted by latest first Latest Oldest Best
This should do the Trick:
<!--googleoff: index-->
<p>hide me!</p>
<!--googleon: index-->
For more information check out the link to Googles page that describes it in more depth.
Excluding Unwanted Text from the Index
Basically you want to hide pieces of content from google right?
Indeed google has some hability to crawl javascript, so adding your content through javascript is not google bot proof, I don't think anything short of an image will be 100% google proof, it's worth checking if the image is a viable option since a jpg with caching maybe demand very little from your bandwidth.
other then that I can think of two solutions, but I'm not 100% sure of both:
you could serve different pages to crawler bots (checking the user agent) and hiding the name of the author, but it will probably hurt you as its a common cloaking technique.
you could load the names on iframes , google does crawl them, but they are not recommended by them (google faq on iframes ), then you could set the noindex and/or the robots.txt to hide the iframe pages from google.
You could simply render the author name in an image which would achieve what you need to (provided you obfuscate the filename.jpg and block image crawlers) without a great deal of disruption and work.
Terms of Use Create Support ticket Your support tickets Stock Market News! © vmapp.org2024 All Rights reserved.