Bomshteyn Consulting

Google Crawl Limit Checker

Check if Your HTML Content Exceeds Google's 2MB Limit

Google has specified that its crawl limit for HTML and supported text-based files is 2MB. Any content beyond this cutoff is simply ignored by Googlebot and won't be indexed. Use this free tool to check if your pages are within the limit.

Why Does This Matter?

  1. Content May Be Invisible: If your HTML exceeds 2MB, Googlebot stops crawling and any content after the cutoff is not indexed. Important text, links, or structured data at the bottom of large pages may never appear in search results.
  2. SEO Impact: Pages with bloated HTML (inline styles, excessive JavaScript, or large DOM trees) risk losing valuable content from Google's index, directly affecting rankings.
  3. Resource Limits Apply Separately: Each resource referenced in your HTML (CSS, JavaScript) is fetched separately and each is subject to the same 2MB limit.
  4. PDF Files Have a Higher Limit: Google is more generous with PDFs, crawling the first 64MB.

Source: Google Search Central Documentation