: Should I include everything in the sitemap or only new content? For a website with dynamic content (new content is constantly being added), should I only include the newest content in the sitemap
For a website with dynamic content (new content is constantly being added), should I only include the newest content in the sitemap or should I include everything (with a sitemap index)? What are the best practices for sitemaps esp. for large sites?
Also, is there anyway to make google (and other search engines) only crawl the pages in the sitemap?
Thanks
Update:
Also, any idea how stackoverflow handle this? I'd like to know but unfortunately (also understandingly) they have blocked access to their sitemap.
More posts by @Steve110
1 Comments
Sorted by latest first Latest Oldest Best
Include all pages. The purpose of the XML sitemap is to tell the search engines about all of your content. Not just the new stuff.
From the sitemaps.org website (emphasis mine):
Sitemaps are an easy way for
webmasters to inform search engines
about pages on their sites that are
available for crawling.
If you have a lot of content you can use multiple XML sitemaps.
If you have content that you don't want to have crawled or indexed you need to specifically tell the search engines not to crawl and index those pages. Use a robots.txt file to block any pages or directories that you do not wish to have crawled. You can also use a meta tag for that as well. But you cannot specify in an XML sitemap not to crawl unlisted pages.
Terms of Use Create Support ticket Your support tickets Stock Market News! © vmapp.org2025 All Rights reserved.