: Does specifying full webpage path in robots.txt would affect my website? It may be a silly question but I need to clarify my doubt because it's related to robots.txt. I need to prevent some

It may be a silly question but I need to clarify my doubt because it's related to robots.txt.

I need to prevent some label path of my website using robots.txt file.

User-agent: *
Disallow: /directory/wp-admin/path/label

If I give like this, would it affect all the paths separately?

Which means, does Googlebot consider directory, wp-admin, path and label separately?

10.02% popularity Vote Up Vote Down

: What does "Disallow: /search" mean in robots.txt? In my blog's Google Webmaster Tools panel, I found the following code in my robots.txt of blocked URLs section. User-agent: Mediapartners-Google

@Steve110

Posted in: #GoogleSearchConsole #RobotsTxt #SearchEngines #WebCrawlers

5 Comments

: Rel="next" in anchor tag is not working I have 100's of duplicate meta description issue in google webmaster tool because of the pagination. Pagination pages are showing as duplicate in tool.

@Steve110

Posted in: #GoogleSearchConsole #Pagination #Rel

2 Comments

: Should "other interest" outbound link be nofollow? I am usually blogging about programming issues in my blog. Occasionally I like to write guest post in the "travel" category. I like to link

@Steve110

Posted in: #Blog #Nofollow

2 Comments

: A date format that isn't picked up by Google as keyword My site is lacking a bit of written content through reasons I cannot control, but it does feature a lot of dates. Currently my top

@Steve110

Posted in: #DateFormat #Seo

1 Comments

Login to post a comment!

2 Comments

Sorted by latest first Latest Oldest Best

@Ann8826881

The path specified in the Disallow: field is simply a URL prefix. So, any URL that starts with this prefix will be blocked.

Disallow: /directory/wp-admin/path/label

From your example, this will therefore block all of the following URLs:

/directory/wp-admin/path/label
/directory/wp-admin/path/labelfoo
/directory/wp-admin/path/label/
/directory/wp-admin/path/label/bar.html

But will not block:

/directory/wp-admin/path/foo
/directory/wp-admin/path/
/directory/wp-admin/hello.html
:

Googlebot does not see the separate directories that make up the path. It is simply one value, one URL prefix.

More information on the Google Developers website: developers.google.com/webmasters/control-crawl-index/docs/robots_txt

10% popularity Vote Up Vote Down

@Jessie594

Robots.txt treats each path directive individually...

So, for example:-

User-agent:*
Disallow: /directory/wp-admin/path/label

This would disallow crawling of the directory label but not everything preceeding this directory nor following it.

User-agent:*
Disallow: /directory/wp-admin/path/label/*

This would disallow the crawling of everything within the directory label , including sub directories and their contents and the directory itself.

10% popularity Vote Up Vote Down

Feed

: Does specifying full webpage path in robots.txt would affect my website? It may be a silly question but I need to clarify my doubt because it's related to robots.txt. I need to prevent some

More posts by @Steve110

: What does "Disallow: /search" mean in robots.txt? In my blog's Google Webmaster Tools panel, I found the following code in my robots.txt of blocked URLs section. User-agent: Mediapartners-Google

: Rel="next" in anchor tag is not working I have 100's of duplicate meta description issue in google webmaster tool because of the pagination. Pagination pages are showing as duplicate in tool.

: Should "other interest" outbound link be nofollow? I am usually blogging about programming issues in my blog. Occasionally I like to write guest post in the "travel" category. I like to link

: A date format that isn't picked up by Google as keyword My site is lacking a bit of written content through reasons I cannot control, but it does feature a lot of dates. Currently my top

Login to post a comment!

2 Comments

Back to top | Use Dark Theme