Stay organized with collectionsSave and categorize content based on your preferences.
February 07, 2006
The new features released yesterday include a list of common words in your site's content and in
external links to your site. In some cases, these common words may not match what you expect from
your current content. The common words are calculated based on the results of the Googlebot
crawler. This can affect the data in a number of ways:
Googlebot hasn't crawled all pages on your site.If words on particular pages
are missing, make sure that those pages are being successfully crawled. If those pages are not
yet in your sitemap, adding them will help guide Googlebot to those portions of your site. Also,
make sure that you link to the pages of your site from within your site (for instance, by using
an HTML site map).
Your site has changed since we last crawled it.If you have just redesigned
your site, made significant content changes to an existing site, or purchased an existing domain
and changed the contents, the data will not be updated until Googlebot has successfully crawled
the new or changed pages.
Googlebot is unable to re-crawl modified pages.Googlebot may be unable to
access your server due to network error, or may encounter a server error when trying to load
your pages. Make sure that your server is responding properly to incoming requests.
Your site is not being crawled.New sites may take some time to be fully
crawled. Read through ourinformation for webmastersfor more
information about our crawling processes. Also, make sure your site doesn't violate theWebmaster Guidelines.
If you are seeing unexpected data and none of these apply to you, please let us know by posting
in ourGoogle Group.
We always appreciate user feedback and are working to improve Google Sitemaps.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],[],[[["\u003cp\u003eGoogle Sitemaps now shows common words found on your site and in external links, which might not always align with your expectations.\u003c/p\u003e\n"],["\u003cp\u003eThis data is based on the Googlebot crawler and can be affected by factors like incomplete crawling, recent site changes, crawling errors, and new site indexing delays.\u003c/p\u003e\n"],["\u003cp\u003eEnsure all your pages are crawlable, linked internally, and adhere to Webmaster Guidelines for optimal data accuracy.\u003c/p\u003e\n"],["\u003cp\u003eIf you encounter unexpected data unrelated to these factors, report it to the Google Group for feedback and improvements to Google Sitemaps.\u003c/p\u003e\n"]]],["Googlebot's crawler data determines the common words listed for a site, which may be inaccurate. This can result from Googlebot not crawling all site pages, recent site changes, inability to re-crawl modified pages due to server issues, or the site being new. To improve accuracy, ensure all pages are crawled, included in the sitemap, linked internally, and that the server responds to requests. Report any discrepancies via the Google Group.\n"],null,["# Unexpected Common Words\n\n| It's been a while since we published this blog post. Some of the information may be outdated (for example, some images may be missing, and some links may not work anymore).\n\nFebruary 07, 2006\n\n\nThe new features released yesterday include a list of common words in your site's content and in\nexternal links to your site. In some cases, these common words may not match what you expect from\nyour current content. The common words are calculated based on the results of the Googlebot\ncrawler. This can affect the data in a number of ways:\n\n- **Googlebot hasn't crawled all pages on your site.** If words on particular pages are missing, make sure that those pages are being successfully crawled. If those pages are not yet in your sitemap, adding them will help guide Googlebot to those portions of your site. Also, make sure that you link to the pages of your site from within your site (for instance, by using an HTML site map).\n- **Your site has changed since we last crawled it.** If you have just redesigned your site, made significant content changes to an existing site, or purchased an existing domain and changed the contents, the data will not be updated until Googlebot has successfully crawled the new or changed pages.\n- **Googlebot is unable to re-crawl modified pages.** Googlebot may be unable to access your server due to network error, or may encounter a server error when trying to load your pages. Make sure that your server is responding properly to incoming requests.\n- **Your site is not being crawled.** New sites may take some time to be fully crawled. Read through our [information for webmasters](/search/docs/fundamentals/how-search-works) for more information about our crawling processes. Also, make sure your site doesn't violate the [Webmaster Guidelines](/search/docs/essentials).\n\n\nIf you are seeing unexpected data and none of these apply to you, please let us know by posting\nin our\n[Google Group](https://support.google.com/webmasters/community).\nWe always appreciate user feedback and are working to improve Google Sitemaps.\n\nPosted by Andrey Stroilov, Google Engineering"]]