Blog / Content Site
Optimized for long-form content with comprehensive SEO checks.- Focuses on content quality (word count, TOC, citations)
- Longer titles and descriptions for blog posts
- Excludes admin areas and media files
- Preserves pagination parameters
- Validates external citations
E-commerce Site
Optimized for product pages and conversion paths.- Focuses on product and category pages
- Excludes cart/checkout (dynamic content)
- Preserves filter and sort parameters
- Allows shorter content for products
- Excludes conversion pages from dead-end check
- Longer cache for external links (products stable)
Documentation Site
Optimized for technical documentation and cross-referencing.- Focuses on documentation sections
- Excludes images from crawl
- Shorter content acceptable (reference material)
- Requires TOC for most pages
- Expects heavy internal linking
- Allows technical term repetition
- Short external link cache (docs update often)
Marketing Landing Pages
Optimized for conversion-focused single pages.- Small page count (focused audits)
- Disables TOC, citations (not relevant)
- Disables orphan/dead-end checks (landing pages isolated)
- Very short content acceptable
- Minimal internal links (CTA focused)
- No external link checking (minimal outbound)
- Focuses on core SEO, mobile, performance
News / Magazine Site
Optimized for frequently updated content.- Excludes fast-changing breaking news
- Short external link cache (news changes)
- Requires citations (journalism)
- Expects longer content
- Strict performance requirements
- Low prefix budget (balanced coverage)
SaaS Marketing Site
Optimized for product marketing and conversion.- Excludes logged-in app areas
- Focuses on marketing pages
- Shorter content acceptable (marketing)
- Pricing/signup pages are intentional dead-ends
- Disables irrelevant categories (local, video)
Local Business Site
Optimized for local SEO and service pages.- Small site (100 pages typical)
- Focuses on local SEO rules
- Longer title to include location
- Shorter content acceptable
- Long external link cache (info stable)
- Disables citation requirements
CI/CD Pipeline
Fast, focused checks for automated testing.- Fast crawl (no delays)
- Only critical checks
- No external link validation
- JSON output for parsing
- Lenient thresholds (just presence checks)
- Ignores robots.txt (testing environment)
Multi-Domain Project
Crawl main site and subdomain together.example.comwww.example.comblog.example.comdocs.example.comshop.example.com
domains = ["example.com"]allows all subdomains- Higher page limit for multiple domains
- Per-host concurrency prevents overwhelming single subdomain
- Balanced prefix budget for diverse coverage
High-Volume Crawl
Large site with thousands of pages.- High page limit
- Aggressive concurrency
- Low prefix budget (balanced coverage)
- Disables slow rules
- Fast external link timeout
- JSON output for large datasets
Local Development
Fast crawling for localhost testing.- No delays (localhost is fast)
- Disables security checks (localhost is HTTP)
- No external link checking
- Standard requests (faster)
- Console output for quick feedback
Accessibility Audit
Focus on WCAG compliance.- Only accessibility and mobile rules
- Includes image alt text
- HTML report for visual review
- Moderate page count
Performance Audit
Focus on Core Web Vitals hints.- Only performance rules
- Strict TTFB thresholds
- Strict DOM size limits
- HTML report for visualization
Security Audit
Focus on HTTPS, headers, and security.- Only security rules
- Validates external HTTPS
- JSON for automated security testing
- Moderate page count
Complete Production Site
Comprehensive audit for production deployment.- Polite crawling (production site)
- All rules enabled (comprehensive)
- External link validation
- Quality content thresholds
- HTML report for sharing
- Respects robots.txt
- Browser impersonation for reliability
Related
- Configuration Overview - Understanding config system
- Crawler Settings - Crawl behavior
- Rules Configuration - Rule patterns
- Rule Options - Per-rule config