reply URL With the new update of Google algorithm called Penguin, I think my site was being penalized due to webspam. But of course I don't create post which seems

Posted in: #Drupal

With the new update of Google algorithm called Penguin, I think my site was being penalized due to webspam. But of course I don't create post which seems to be spam to Google. It is just I think how Google index my site.

I found out that Google index the URL of my site like:
www.example.com/comment/reply/3866/26556
So there are so many comment/reply URL index by Google. I have already added:

Disallow: /comment/reply/ Disallow: /?q=comment/reply/

but still Google still index this URL.

Any idea how to prevent Google from indexing comments?

10.04% popularity Vote Up Vote Down

: Overuse of was bad, but now, with HTML5? We know that the overuse of <h1> can lead to an over-optimization penalty, but now, in HTML5 it's allowed to use multiple <h1> in the

@Yeniel560

Posted in: #Html5 #Seo

5 Comments

: Redirect Permanent and https I just set up HTTPS on my server, and I have an issue with redirect permanent. Example http://domain.com/index.html it redirect me to http://www.domain.comindex.html

@Yeniel560

Posted in: #Apache #HttpdConf #Https #Virtualhost

2 Comments

: Use a XMPP server like Openfire to handle the chat sessions. Then you can write your own XMPP client (e.g. in jQuery or wherever you want to run the client on) or check out this nice thread

@Yeniel560

0 Comments

: Is it possible to mark an entire page as "obsolete", specifying an updated page for SEO? Say I have a program for which I have written documentation. I then release a new version of that

@Yeniel560

Posted in: #Seo

1 Comments

Login to post a comment!

4 Comments

Sorted by latest first Latest Oldest Best

@Angela700

Using disallow in your robots file will not stop Google from indexing those links or pages. That only tells Google do not crawl them.

If those pages are linked to from other pages on your domain they still will index the pages.

10% popularity Vote Up Vote Down

@Kevin317

You haven't mentioned how long ago you added those Disallow rules. The effect isn't instantaneous, requiring at the very least a wait until you're spidered again, and even then might take a bit longer for them to actually get removed from the index/results.

If you use Webmaster Tools, are they showing up in your "Crawler access" screen(under Site Configuration)? That'll at least give you an idea when the robots.txt file was last grabbed.

10% popularity Vote Up Vote Down

@Shakeerah822

You can use google webmaster tool: site-configuration section -> site links in order to demote links on your website. You can also use robots.txt as suggested by Ilmari Karonen as well as configure .htaccess (or httpd.conf) to preform 301 redirect

10% popularity Vote Up Vote Down

@Michele947

Have you made sure that your robots.txt syntax is correct? If you've signed up for Google's Webmaster Tools, you can use their robots.txt testing tool to see how Googlebot interprets it, but there are also several third-party robots.txt syntax checkers on the web.

You can also add robots meta tags to your reply pages to stop search engines from indexing them. One reason to do this, even if you have the pages disallowed in robots.txt, is that not all bots necessarily understand the fancier robots.txt syntax extensions such as * wildcards, or at least may not understand them the same way.

10% popularity Vote Up Vote Down

Feed

: Google keeps indexing /comment/reply URL With the new update of Google algorithm called Penguin, I think my site was being penalized due to webspam. But of course I don't create post which seems

More posts by @Yeniel560

: Overuse of was bad, but now, with HTML5? We know that the overuse of <h1> can lead to an over-optimization penalty, but now, in HTML5 it's allowed to use multiple <h1> in the

: Redirect Permanent and https I just set up HTTPS on my server, and I have an issue with redirect permanent. Example http://domain.com/index.html it redirect me to http://www.domain.comindex.html

: Use a XMPP server like Openfire to handle the chat sessions. Then you can write your own XMPP client (e.g. in jQuery or wherever you want to run the client on) or check out this nice thread

: Is it possible to mark an entire page as "obsolete", specifying an updated page for SEO? Say I have a program for which I have written documentation. I then release a new version of that

Login to post a comment!

4 Comments

Back to top | Use Dark Theme