Mobile app version of vmapp.org
Login or Join
Murray155

: Would a 403 page, a blank page and a page redirect be indexed by search robots? I am wondering if a disallow directive needs to be issued to robots on pages that will generate: a 403 forbidden

@Murray155

Posted in: #403Forbidden #Googlebot #Redirects #RobotsTxt #Seo

I am wondering if a disallow directive needs to be issued to robots on pages that will generate:


a 403 forbidden error;
a blank page, or;
a page redirect (to log in page for example).


Would not doing so have any impact on SEO of the site?

10.02% popularity Vote Up Vote Down


Login to follow query

More posts by @Murray155

2 Comments

Sorted by latest first Latest Oldest Best

 

@Samaraweera270

Will they be indexed:


403 Error – no, most search engines don't index error pages (assuming they're served with the correct HTTP status code)
Blank page – almost certainly not, assuming it is entirely blank (no <title>, nothing).
Page redirect – Generally only the destination of the redirect will be indexed (usually this is the intended result), but it depends on the type of redirect. For 301 the redirecting page will be de-indexed in favour of the destination page, however for 302 often both the referring and destination page can continue to be indexed.


SEO impact

From the above, you should be able to see that assuming everything is configured correctly, then in cases (1) and (3) we don't need to do anything else to manage these pages' SEO impact.

In the case of (2), we probably don't need to do anything else for a genuinely blank page, though I'd recommend avoiding generating blank pages at all if possible. If not, Disallow is better than nothing, and if you can apply "noindex" directly to the page either via HTML or HTTP header, better still.

10% popularity Vote Up Vote Down


 

@Radia820

You should block these directories in the robots, not so much for ranking purposes because it doesn't matter Google often finds lots of 403 and its not irregular for Google to find them. But it'll clutter your Web Master Tools so it is best to block them.

Block Registered User Areas with the Following:

Robots.txt

User-agent: *
Disallow: /user-area-here-change-me/


Also use noindex as Google recommends both: On all registered area pages use:

<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme