Paste your XML sitemap and instantly validate structure, URLs, required tags, and duplicates. Free, no sign-up required.
Open your sitemap URL (usually /sitemap.xml), copy all content, and paste it here
Other free tools to help you audit and optimize your site.
Paste your content and check keyword density instantly. Optimize without over-stuffing.
Use Tool \u2192Analyze your content readability with Flesch Reading Ease and grade-level scores.
Use Tool \u2192Check if your page title fits Google SERP limits and preview how it looks in search results.
Use Tool \u2192Generate compelling meta descriptions that improve click-through rates from search results.
Use Tool \u2192Build a valid robots.txt file to control how search engines crawl your website.
Use Tool \u2192Validate your Open Graph tags and preview how your pages look when shared on social media.
Use Tool \u2192Generate JSON-LD structured data for articles, products, FAQs, and more.
Use Tool \u2192Paste your HTML and analyze internal link structure, anchor text, and linking patterns.
Use Tool \u2192Count words, characters, sentences, and reading time for any piece of content.
Use Tool \u2192Compare two blocks of text to detect duplicate or near-duplicate content before publishing.
Use Tool \u2192Generate 5 SEO-optimized title tag variations from your current title and primary keyword.
Use Tool \u2192Compare your page URL and canonical URL to detect mismatches and canonicalization issues.
Use Tool \u2192Extract and categorize all anchor text from your HTML to identify over-optimization risks.
Use Tool \u2192How It Works
No account needed, no sign-up required. Completely free. Paste your sitemap XML to instantly see your URL count, detected errors, and a full list of issues to fix.
Navigate to your sitemap in a browser. Most sites use /sitemap.xml. You can also find it referenced in your robots.txt file under the Sitemap: directive. Copy all the XML content from the page.
Paste the raw XML content into the text area above. The validator handles standard sitemaps and sitemap index files. No need to format or clean the XML first.
Instantly see your URL count, error count, warning count, and a detailed list of every issue with a clear explanation of what to fix. Then resubmit a corrected sitemap to Google Search Console.
Validation Rules
This tool runs eight validation checks against the official Sitemap Protocol specification and Google Webmaster Guidelines to catch the most impactful issues.
Sitemap Health Score
0 errors = Valid | 1-3 issues = Fair | 4+ issues = Invalid
Example: 1 missing namespace + 2 invalid URLs = Fair. Fix errors first, warnings second.
The validator checks your XML against the Sitemap Protocol v0.9 specification published by sitemaps.org and cross-referenced with Google Search Console requirements. Errors are issues that will cause Google to reject or ignore your sitemap. Warnings are issues that reduce sitemap effectiveness. Info items are optional improvements.
A sitemap with even one error-level issue may be completely ignored by Google Search Console. Fix all errors before submitting, then address warnings to maximize crawl efficiency. Info-level suggestions are optional but recommended for large or frequently updated sites.
Validation Checks
Use this reference to understand what each check means and how to fix issues at each severity level before resubmitting your sitemap.
| Validation Check | Severity | Description |
|---|---|---|
| Valid XML structure | Error | The file must be well-formed XML that parsers can read without errors |
| Correct namespace declaration | Error | Must include xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" |
| Presence of <loc> tags | Error | Every <url> entry must have a <loc> element with an absolute URL |
| Absolute URLs (http/https) | Error | All loc values must start with http:// or https:// |
| Duplicate URL detection | Warning | Each URL should appear exactly once in the sitemap |
| <lastmod> presence | Warning | Last-modified dates help Google prioritize crawling of recently updated pages |
| <changefreq> presence | Info | Optional hint to search engines about expected update frequency |
| <priority> presence | Info | Optional value from 0.0 to 1.0 indicating relative importance of a URL |
Based on Sitemap Protocol v0.9 and Google Search Console requirements, 2026.
Sitemap Types
Different content types and site sizes require different sitemap formats. Use this guide to choose the right structure for your site architecture.
| Sitemap Type | Root Element | Best For | Notes |
|---|---|---|---|
| Standard URL sitemap | <urlset> | Any site with up to 50,000 pages | Most common type. Lists individual page URLs with optional metadata. |
| Sitemap Index | <sitemapindex> | Sites with 50,000+ pages or multiple sections | References multiple child sitemaps. Acts as a master index. |
| Image sitemap | <image:image> | Sites with indexable image content | Extends standard sitemap with image metadata for Google Image Search. |
| Video sitemap | <video:video> | Sites hosting video content | Provides video metadata to help Google index video pages correctly. |
| News sitemap | <news:news> | Google News publishers | Required for Google News inclusion. Must include articles from the past 2 days. |
Sitemap types per sitemaps.org protocol and Google documentation, 2026.
What Breaks Your Sitemap
These mistakes are surprisingly common, even on established sites. Each one reduces your sitemap effectiveness and can leave important pages out of search results.
Your sitemap should only include canonical, indexable URLs. Including redirected URLs, paginated pages, noindex pages, or duplicate content forces Google to process URLs that add no value. Every URL in your sitemap signals to Google that it is important enough to index.
Only include canonical, indexable pagesWithout lastmod tags, Google cannot tell which pages have been recently updated and may deprioritize crawling. Accurate lastmod dates help search engines focus their crawl budget on fresh content. Inaccurate dates (always set to today) undermine this signal and can be penalized.
Include accurate lastmod dates for all URLsGoogle caps each sitemap file at 50,000 URLs and 50MB uncompressed. Oversized sitemaps may be partially processed or ignored. Use a sitemap index file to split large sitemaps into manageable files organized by content type or date range.
Use sitemap index files for sites over 50,000 pagesAll <loc> values must be absolute URLs starting with http:// or https://. Relative paths like /about or //example.com/page are invalid and will cause the sitemap to be rejected or ignored by search engine crawlers.
Always use full absolute URLs in <loc> tagsA sitemap that does not reflect your current site structure misleads crawlers. When you add new pages, remove old ones, or reorganize your site, update and resubmit your sitemap immediately. Stale sitemaps reduce crawl efficiency and delay indexing of new content.
Regenerate your sitemap whenever site structure changesA valid sitemap that is not submitted to Google Search Console and Bing Webmaster Tools offers limited SEO value. Submission triggers faster discovery, provides crawl error reporting, and lets you see which URLs have been indexed. Both tools are free to use.
Submit to Google Search Console and Bing Webmaster ToolsSitemap Best Practices
Use these strategies to ensure every important page on your site gets discovered, crawled, and indexed. All CommonNinja widgets mentioned are free to start.
Before adding a new page URL to your sitemap, ensure it has enough content to justify indexing. Accordion-organized FAQ sections add substantial word count and keyword coverage that make pages worth crawling.
Try Accordion widget →Pages with tabs containing specifications, reviews, and FAQs have more unique content than single-section pages. Include these richer pages in your sitemap to maximize the indexable content per URL.
Try Tabs widget →Comparison table pages tend to rank for high-intent commercial keywords. Add these pages to your sitemap with higher priority values to signal their importance to search engines.
Try Comparison Tables widget →Dynamic content feeds create new indexable pages regularly. Add a mechanism to automatically update your sitemap when new feed content is published so crawlers discover fresh URLs faster.
Try Feeds widget →Add a Sitemap: directive to your robots.txt file so any crawler, not just Google, can discover your sitemap automatically. The format is: Sitemap: https://example.com/sitemap.xml on its own line.
Sitemaps can be gzip compressed to reduce file size. Google and Bing both support .xml.gz format. Compressed sitemaps load faster, which is particularly important for sitemaps with tens of thousands of URLs.
After submitting your sitemap, check the Coverage report in Google Search Console regularly. It shows which URLs are indexed, which are excluded, and which have errors. This is your direct feedback loop for sitemap effectiveness.
If your site serves multiple languages or regions, use hreflang annotations in your sitemap to tell Google which pages target which audiences. This prevents duplicate content issues and ensures users see the right language version in search results.
Technical SEO Glossary
Understanding these concepts helps you make better decisions about how to structure your sitemap and communicate page priorities to search engines.
| Term | Definition | Format / Formula | When to Use |
|---|---|---|---|
| XML Sitemap | A structured file that lists URLs on your website along with optional metadata. Submitted to search engines to guide crawling and indexing priorities. | Sitemap Protocol v0.9 | Every website with more than a handful of pages. Required for efficient crawl coverage. |
| Crawl Budget | The number of pages a search engine bot will crawl on your site within a given timeframe. Sitemaps help direct crawl budget to your most valuable pages. | Crawl Rate x Crawl Demand | Large sites with thousands of pages where not all pages get crawled in each cycle. |
| Sitemap Index | A master sitemap file that lists references to multiple individual sitemap files. Used when a site exceeds 50,000 URLs or when organizing sitemaps by content type. | <sitemapindex> root element | Sites with more than 50,000 URLs or multiple distinct content categories. |
| lastmod | An optional XML tag that indicates the date a URL was last modified. Helps search engines prioritize crawling recently updated pages over stale ones. | YYYY-MM-DD format (ISO 8601) | Whenever you update a page. Use actual modification dates, not today's date on all pages. |
| changefreq | An optional hint to search engines about how often a page is expected to change. Values: always, hourly, daily, weekly, monthly, yearly, never. | Qualitative string value | Providing crawl frequency hints for content that updates on predictable schedules. |