Skip to main content
Checks HTML document size against Googlebot crawl limits
Rule IDcrawl/html-size
CategoryCrawlability
ScopePer-page
Severityerror
Weight5/10

Solution

Googlebot truncates HTML documents at 2MB—content beyond that limit is silently ignored during indexing. Move inline styles and scripts to external files, defer non-critical content, lazy-load below-the-fold sections, and remove unnecessary markup. Keep critical SEO content (title, meta, headings, main body) near the top of the document so it’s indexed even if truncation occurs.

Options

This rule supports the following configuration options:
OptionTypeDefaultDescription
warn_bytesunknownundefinedByte size to trigger warning
error_bytesunknownundefinedByte size to trigger error (Googlebot truncation limit)

Configuration Example

squirrel.toml
[rules."crawl/html-size"]
warn_bytes = undefined
error_bytes = undefined

Enable / Disable

Disable this rule

squirrel.toml
[rules]
disable = ["crawl/html-size"]

Disable all Crawlability rules

squirrel.toml
[rules]
disable = ["crawl/*"]

Enable only this rule

squirrel.toml
[rules]
enable = ["crawl/html-size"]
disable = ["*"]