SEO Reporter

A powerful TypeScript CLI tool that crawls websites, extracts SEO metadata, detects common issues, and generates comprehensive HTML reports. Think of it as a free, open-source alternative to Screaming Frog's core SEO analysis features.

Quick Start

``bash

`Install dependencies`


pnpm install
Run the crawler

pnpm dev --url https://example.com
View results

open seo-report/index.html


Features
$3

- 🕷️ Website Crawling: Breadth-first traversal with configurable depth and concurrency
- 📊 220+ SEO Checks: Implements 85-90% parity with Screaming Frog's core analysis 
- ⚡ Fast & Efficient: Concurrent crawling with configurable request limits and memory-efficient processing
- 📝 Interactive HTML Reports: Beautiful, filterable reports with severity-based issue categorization (now with working filters, sortable columns, default severity sort, live filtered totals, a new Issues-by-Type view with drill-down, and a reorganized nav with a Content dropdown)
- 💡 Actionable Tooltips: Hover over any issue to see specific fix recommendations (25+ guidance tips). Improved (i) tooltips now render consistently across the report.
- 📱 Mobile Responsive: All reports fully responsive with touch-friendly interfaces (44px tap targets)
- 🖥️ Built-in Server: Zero-dependency static file server using Node.js built-ins (

seo-reporter serve

)
- 🔌 JSON API Routes: Complete RESTful-style JSON API for all SEO data - perfect for integrations and custom dashboards
- 🧭 Clickable Site Structure: Navigate from the site structure tree directly to per-page details
- 📤 CSV Export: Export all data to Excel-compatible CSV files for further analysis
- 🤖 Robots.txt Integration: Automatic robots.txt parsing and compliance
- 🗺️ Sitemap Analysis: Auto-detects and analyzes XML sitemaps (runs by default)
$3

- Comprehensive On-Page Data: Titles, descriptions, headings, canonical URLs, robots directives
- Social Media Tags: Open Graph, Twitter Cards
- Internationalization: hreflang attributes with validation
- Structured Data: JSON-LD & microdata extraction
- Links: Internal/external links with anchor text, nofollow detection
- Images: Alt text, dimensions tracking
- Content Metrics: Word count, text length, HTML size, content-to-code ratio
- Performance: Response times, redirect chains
- Scripts: External JavaScript detection with async/defer tracking
$3

- Protocol Security: HTTP vs HTTPS detection
- Mixed Content: HTTP resources on HTTPS pages
- Insecure Forms: Form actions over HTTP
- Security Headers: HSTS, CSP, X-Frame-Options, X-Content-Type-Options, Referrer-Policy
- Protocol-Relative URLs: Detection of

 URLs
$3

- URL Issues: Multiple slashes, spaces, non-ASCII characters, uppercase letters
- URL Structure: Repetitive paths, overly long URLs (>2083 chars)
- Parameters: Query params, tracking params, internal search URLs
- Fragment URLs: Detection of fragment-only links
$3

- Duplicate Detection: 
  - Exact duplicates (SHA-256 hash-based)
  - Near duplicates (>90% similarity using MinHash)
  - Duplicate H1/H2 across pages
- Content Analysis: 
  - Lorem ipsum detection
  - Soft 404 detection
  - Readability metrics (Flesch-Kincaid Grade, Reading Ease, ARI)
  - Poor readability warnings (>12th grade level)
  - Thin content detection (<300 words)
$3

- Titles: Missing, duplicate, too long/short (characters + pixel width), multiple tags, outside

, identical to H1
- Meta Descriptions: Missing, duplicate, too long/short (characters + pixel width), multiple tags, outside


- Headings: Missing H1, multiple H1, broken hierarchy, overly long (>70 chars), empty headings
- Canonical: Multiple/conflicting tags, relative URLs, fragments, outside

, invalid attributes
- Robots: Conflicting directives, noindex/nofollow detection
- Indexability Tracking: Comprehensive analysis of why pages are/aren't indexable
  - Non-200 status codes (404, 500, etc.)
  -

noindex

 in meta robots tag
  -

noindex

 in X-Robots-Tag header  
  - Canonical pointing to different URL
  - Detailed reasons shown in Issues tab and individual page reports
- Pagination: rel="next"/rel="prev" validation, multiple pagination links, sequence errors
$3

- Anti-Detection Crawling: Bypass basic bot detection systems with realistic browser simulation
- User Agent Rotation: 20+ realistic user agents from Chrome, Firefox, Safari, Edge across Windows, macOS, and Linux
- Header Randomization: Dynamic browser headers with realistic patterns and Chrome sec-ch-ua headers
- Human-Like Timing: Intelligent delays (1-8 seconds) simulating quick, normal, and slow browsing patterns  
- Proxy Support: Rotate through multiple proxy servers with automatic failover and validation
- Session Management: Maintain consistent headers across requests for realistic browsing simulation
- Custom Configuration: Define your own user agents, proxies, and timing patterns
- Seamless Integration: Works with all existing crawl modes and analysis features
$3

- 404 Tracking: 
  - Dedicated 404 Pages tab with referrer tracking
  - Shows which pages link to each 404 (now normalized so

/path and /path/

 are treated the same)
  - Helps identify and fix broken internal links
- Link Quality: 
  - Orphan pages (no internal inlinks)
  - Dead ends (no outlinks)
  - Weak anchor text ("click here", empty, too short)
  - Localhost links (127.0.0.1)
  - Missing protocol on external links (e.g.,

facebook.com without https://

)
- Link Metrics: 
  - Internal vs external link counts
  - High outlink warnings (>100 internal, >50 external)
  - Inlink count per page
  - Crawl depth distribution
$3

- Document Structure: Missing/multiple

or

 tags
- Element Positioning: Tags outside

 that should be inside
- Document Order: Incorrect

/

 ordering
- Size & Complexity: Large HTML (>1MB), excessive DOM depth (>30 levels)
- Invalid Elements: Elements that shouldn't be in


$3

- 🔴 High Severity: Missing titles/H1s, HTTP pages, mixed content, insecure forms, soft 404s, lorem ipsum, malformed HTML
- 🟡 Medium Severity: Title/description length issues, multiple H1s, images without alt, thin content, slow pages, security headers missing
- 🔵 Low Severity: Heading hierarchy, redirect chains, URL quality issues, readability warnings, informational notices
Performance ⚡
SEO Reporter includes a Rust-powered native module for near-duplicate content detection, providing massive performance gains for large sites:
$3
| Pages | TypeScript (O(n²)) | Rust + LSH (O(n)) | Speedup |
|-------|-------------------|-------------------|---------|
| 100   | ~10s              | ~0.1s             | 100x |
| 500   | ~2.5min           | ~0.5s             | 300x |
| 1000  | ~10min            | ~1s               | 600x |
| 5000  | ~4 hours          | ~5s               | ~3000x |
$3
The Rust module uses Locality-Sensitive Hashing (LSH) with MinHash signatures:
- Generates 128-hash MinHash signatures for each page
- Groups pages into buckets using 16 bands × 8 rows
- Only compares pages that share at least one bucket (candidates)
- Reduces comparisons from O(n²) to O(n) with ~95% accuracy
$3
If the Rust module fails to load (unsupported platform or not built), the tool automatically falls back to the pure TypeScript implementation, ensuring compatibility on all platforms.
⚠️ Rust Warning: When the Rust module is unavailable, the CLI displays a clear warning:

`⚠️ Rust native module not available - using TypeScript fallback for near-duplicate detection Note: Near-duplicate detection will be slower. Runnpm rebuildto build the Rust module.`

This helps users understand why near-duplicate detection may be slower and provides actionable instructions to enable the faster implementation.

`$3`

Pre-built Rust binaries are included for: - macOS (Intel x64 & Apple Silicon ARM64) - Linux (x64 & ARM64, glibc & musl) - Windows (x64)

`Installation`

`$3`

- Node.js 18 or higher - pnpm (recommended) or npm - Rust 1.70+ (optional, only needed if building from source; pre-built binaries included)

`$3`

`bash pnpm install`

`$3`

For maximum performance, install Rust to enable the native module:

`bash

`Automatic Rust installation (Windows, macOS, Linux)`


pnpm setup:rust


This will:
- Download and install rustup (Rust toolchain installer)
- Install the latest stable Rust toolchain
- Set up the environment for building the native module
- Verify the installation
$3

`bash pnpm build`

`$3`

The Rust module provides 100-1000x faster near-duplicate detection. The build process automatically handles Rust environment setup:

`bash

`Install Rust automatically (Windows, macOS, Linux)`


pnpm setup:rust
Full build (Rust + TypeScript)

pnpm build
Rust module only

pnpm build:rust-only
TypeScript only (fallback if Rust unavailable)

pnpm build:ts-only

Note: The build scripts automatically source the Rust environment ($HOME/.cargo/env) if available. Pre-built binaries are included for most platforms, but you can rebuild if needed.

`$3`

`bash

`Development mode (no build required)`


pnpm dev --url https://example.com
Production mode (requires build)

pnpm start --url https://example.com

$3

`bash pnpm install -g . seo-reporter --url https://example.com`

`Usage`

`$3`

`bash

`Crawl and analyze a website`


seo-reporter crawl --url https://example.com
Or use the legacy format (still supported)

seo-reporter --url https://example.com
Start the report server (no URL needed)

seo-reporter serve

$3

`bash

`Crawl command`


seo-reporter crawl \
  --url https://example.com \           # Required: Target URL to crawl
  --depth 3 \                            # Optional: Max crawl depth (default: 3)
  --max-pages 1000 \                     # Optional: Max pages to crawl (default: 1000)
  --concurrency 10 \                     # Optional: Concurrent requests (default: 10)
  --output ./seo-report \                # Optional: Output directory (default: ./seo-report)
  --timeout 10000 \                      # Optional: Request timeout in ms (default: 10000)
  --user-agent "CustomBot/1.0" \         # Optional: Custom user agent
  --export-csv \                         # Optional: Export results to CSV files
  --respect-robots \                     # Respect robots.txt (default: true)
  --ignore-robots \                      # Ignore robots.txt rules
  --crawl-mode both \                    # Optional: Crawl mode - crawl|sitemap|both (default: both)
  --sitemap-url https://example.com/sitemap.xml \  # Custom sitemap URL
  --validate-schema \                    # Validate JSON-LD schema.org data
  --stealth \                            # Enable stealth mode with randomized headers and timing
  --stealth-user-agents "Agent1,Agent2" \ # Custom user agents for stealth mode
  --stealth-min-delay 1000 \             # Minimum delay between requests in stealth mode (ms)
  --stealth-max-delay 5000 \             # Maximum delay between requests in stealth mode (ms)
  --stealth-proxies "proxy1:8080,proxy2:3128"  # Proxy rotation for stealth mode
Serve command

seo-reporter serve \
  --port 8080 \                          # Optional: Port to listen on (default: 8080)
  ./seo-report                           # Optional: Directory to serve (default: ./seo-report)

$3

The --crawl-mode option controls how the tool discovers pages:

- crawl- Follow links only (traditional crawling) -sitemap- Crawl only URLs found in sitemap(s) -both - Crawl sitemap URLs + follow links (default, discovers maximum pages)

Example:`bash

`Only crawl URLs from sitemap`


seo-reporter --url https://example.com --crawl-mode sitemap
Traditional link-based crawling only

seo-reporter --url https://example.com --crawl-mode crawl
Both (default)

seo-reporter --url https://example.com --crawl-mode both


$3
After generating a report, you can view it in two ways:

Option 1: Open directly in browser`bash open seo-report/index.html`

Option 2: Start a local server (Recommended)`bash

`Using the built-in server`


seo-reporter serve seo-report
Or with a custom port

seo-reporter serve seo-report --port 3000
Using npm script

pnpm serve  # Serves ./seo-report on port 8080

The built-in server uses Node.js's native http module (zero dependencies, works everywhere).

`$3`

When running the tool, you'll see detailed progress for each phase:

`bash $ seo-reporter --url https://example.com --max-pages 100

🔍 SEO Reporter

Configuration: URL: https://example.com/ Max Depth: 3 Max Pages: 100 Concurrency: 10 Output: ./seo-report

⠹ Crawling website... 🟢 25/100 pages ⠸ Crawling website... 🟢 50/100 pages ⠼ Crawling website... 🟢 100/100 pages ✔ Crawled 100 pages in 15.2s

⠹ Parsing SEO metadata... 25/100 pages ⠸ Parsing SEO metadata... 50/100 pages ⠼ Parsing SEO metadata... 100/100 pages ✔ Parsed metadata from 100 pages in 3.4s

⠹ Analyzing... Per-page analysis (25/100) ⠸ Analyzing... Per-page analysis (50/100) ⠼ Analyzing... Per-page analysis (100/100) ⠴ Analyzing... Link quality analysis ⠦ Analyzing... Content quality checks ⠧ Analyzing... Finding duplicate titles/descriptions ⠇ Analyzing... Finding exact duplicate content ⠏ Analyzing... Finding near-duplicate content (Rust + LSH) ⠋ Analyzing... Finding duplicate headings ✔ Analysis complete in 2.1s

✔ Sitemap analyzed (95 URLs in sitemap)

📊 Issues Found: ⚠️ 5 pages with missing meta descriptions ⚠️ 3 pages with duplicate titles ...`

Note: The progress counters (e.g., 25/100) show real-time progress during crawling, parsing, and analysis phases, making it easy to estimate remaining time.

`$3`

Crawl a small site with depth 2:`bash seo-reporter --url https://myblog.com --depth 2 --max-pages 100`

Fast crawl with high concurrency:`bash seo-reporter --url https://example.com --concurrency 20 --depth 2`

Crawl and save to custom directory:`bash seo-reporter --url https://example.com --output ./reports/example-audit`

Crawl with CSV export:`bash seo-reporter --url https://example.com --export-csv`

Stealth mode crawling:`bash

`Basic stealth mode`


seo-reporter --url https://example.com --stealth
Stealth with custom timing

seo-reporter --url https://example.com --stealth --stealth-min-delay 2000 --stealth-max-delay 8000
Stealth with custom user agents and proxies

seo-reporter --url https://example.com --stealth \
  --stealth-user-agents "Mozilla/5.0 (Custom Bot),Another Custom Agent" \
  --stealth-proxies "proxy1.example.com:8080,proxy2.example.com:3128"


SEO Issues Detected
The tool checks for the following SEO issues:
$3

- ❌ Missing Title Tags: Pages without a

<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> tag<br />- ❌ <strong>Broken Links</strong>: Pages returning 404 status codes<br />- ❌ <strong>Conflicting Robots Directives</strong>: Multiple robots tags with contradictory instructions (e.g., "index" and "noindex")<br />- ❌ <strong>Multiple Canonical Tags</strong>: Pages with conflicting canonical URLs<br />- ❌ <strong>Malformed JSON-LD</strong>: Structured data scripts with JSON parsing errors</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- ⚠️ <strong>Missing Meta Descriptions</strong>: Pages without meta description tags<br />- ⚠️ <strong>Duplicate Titles</strong>: Multiple pages sharing the same title text<br />- ⚠️ <strong>Duplicate Descriptions</strong>: Multiple pages sharing the same meta description<br />- ⚠️ <strong>Title Too Long</strong>: Titles over 60 characters (may be truncated in search results)<br />- ⚠️ <strong>Title Too Short</strong>: Titles under 20 characters (may not be descriptive enough)<br />- ⚠️ <strong>Description Too Long</strong>: Meta descriptions over 160 characters (may be truncated)<br />- ⚠️ <strong>Description Too Short</strong>: Meta descriptions under 50 characters (may not be informative enough)<br />- ⚠️ <strong>Missing H1 Tags</strong>: Pages without an H1 heading<br />- ⚠️ <strong>Multiple H1 Tags</strong>: Pages with more than one H1 heading<br />- ⚠️ <strong>Improper Heading Hierarchy</strong>: Heading levels that skip numbers (e.g., H1 to H3)<br />- ⚠️ <strong>Images Without Alt Text</strong>: Images missing accessibility alt attributes<br />- ⚠️ <strong>Thin Content</strong>: Pages with less than 300 words<br />- ⚠️ <strong>Slow Page Load</strong>: Pages with response times over 3 seconds</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- ℹ️ <strong>Noindex Pages</strong>: Pages set to noindex (verify if intentional)<br />- ℹ️ <strong>Redirect Chains</strong>: Pages with redirect chains detected<br />- ℹ️ <strong>Multiple Title/Description Tags</strong>: Single page with duplicate meta tags</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- All headings (H1-H6) extracted and analyzed for proper hierarchy<br />- Internal vs external link analysis<br />- Image alt text coverage<br />- Word count and content density metrics<br />- Open Graph and Twitter Card metadata presence<br />- hreflang implementation<br />- JSON-LD and microdata structured data detection</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Output Reports</h2></p><p class="my-3">The tool generates comprehensive reports in the specified output directory:</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- <strong></code>index.html<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Interactive summary report with:<br /> - <strong>Tabbed Interface</strong>: Overview, All Pages, Site Structure, Links, Content, Performance, Scripts, Sitemap, Issues, and <strong>API</strong> tabs<br /> - <strong>Sortable Tables</strong>: Click column headers to sort data ascending/descending<br /> - <strong>Filterable Content</strong>: Search boxes to quickly find specific pages, links, or issues<br /> - <strong>Visual Statistics</strong>: Color-coded cards showing issue counts and severity<br /> - <strong>All Data</strong>: Links analysis (internal/external), headings, images, performance metrics<br />- <strong></code>page-viewer.html<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Dynamic page detail viewer that loads data from JSON files<br /> - Displays complete page metadata, issues, headings, links, and images<br /> - Loads data on-demand from JSON API routes<br /> - Accessed via </code>page-viewer.html?url=<page-url><code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3">Reports are fully self-contained with inline CSS and JavaScript - no external dependencies.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">All SEO data is available as JSON files for programmatic access, integrations, or custom dashboards:</p><p class="my-3">#### Individual Page Data<br />- <strong></code>json/pages/{filename}.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Complete page metadata including:<br /> - Title, meta description, canonical URL, robots directives<br /> - All headings (H1-H6), links, and images<br /> - Content metrics (word count, HTML size, readability scores)<br /> - Performance metrics (response time, redirects)<br /> - Security analysis (HTTPS, headers, mixed content)<br /> - URL quality metrics<br /> - Structured data (JSON-LD, microdata)<br /> - All detected issues with severity levels<br />- <strong></code>json/issues/{filename}.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Page-specific issues with severity counts</p><p class="my-3">#### Aggregate Data Endpoints<br />- <strong></code>json/all-pages.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Summary of all pages with key metrics<br />- <strong></code>json/all-issues.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All issues across all pages<br />- <strong></code>json/issues-summary.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Issues statistics by severity and type<br />- <strong></code>json/links.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All internal and external links<br />- <strong></code>json/images.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All images with alt text status<br />- <strong></code>json/headings.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All headings with levels<br />- <strong></code>json/performance.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Performance metrics for all pages<br />- <strong></code>json/external-scripts.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: External JavaScript usage analysis<br />- <strong></code>json/404-pages.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: 404 pages with referrer tracking<br />- <strong></code>json/sitemap-info.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Sitemap analysis data<br />- <strong></code>json/site-structure.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Site structure tree<br />- <strong></code>json/url-index.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: URL to filename mapping for easy lookups</p><p class="my-3">#### Using the JSON API</p><p class="my-3"></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">bash<br /><h1 class="text-2xl font-bold mt-6 mb-4">Generate report</h1><br />seo-reporter --url https://example.com</p><p class="my-3"><h1 class="text-2xl font-bold mt-6 mb-4">Access JSON data programmatically</h1><br />curl http://localhost:8000/seo-report/json/all-issues.json<br />curl http://localhost:8000/seo-report/json/pages/index.json<br />curl http://localhost:8000/seo-report/json/issues-summary.json</p><p class="my-3"><h1 class="text-2xl font-bold mt-6 mb-4">Or use in your application</h1><br />fetch('./seo-report/json/all-pages.json')<br /> .then(res => res.json())<br /> .then(data => console.log(data.pages));<br /></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3">#### In-Report API Tab</p><p class="my-3">Open the <strong>API</strong> tab in </code>index.html<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> for an at-a-glance list of endpoints, example curl/JS usage, and tips on mapping URLs to filenames via </code>json/url-index.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">. The tab now reliably renders with the updated tab switching logic.</p><p class="my-3">The JSON API is perfect for:<br />- CI/CD pipeline integrations<br />- Custom dashboards and visualizations<br />- Automated monitoring and alerts<br />- Data analysis and reporting scripts<br />- Integration with other SEO tools</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">- Same-domain redirects are followed and analyzed. After redirects within the same domain, links are resolved against the final URL to ensure correct internal/external classification.<br />- Cross-domain redirects are not analyzed or crawled. The redirect chain is recorded for the original URL, but the destination page’s content and links are not fetched or followed.</p><p class="my-3">#### Large Site Performance (10k+ pages)</p><p class="my-3">- For large datasets, reports now use chunked JSONP files and a small client runtime to progressively render big tables.<br />- Tables support pagination, sorting, filtering, and a page-size selector (25/50/100/250/500).<br />- Data files are written to </code>seo-report/data/…/*.js<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> and work offline via </code>file://<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> (no fetch).<br />- Sorting or filtering may trigger background loading of remaining chunks for accuracy.<br />- Small sites still render inline immediately; large sites render almost instantly and stream in data.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />When using </code>--export-csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, the tool generates Excel-compatible CSV files in the </code>csv/<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> subdirectory:<br />- <strong></code>all-pages.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Complete page data with all metrics<br />- <strong></code>links.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All links from all pages (internal/external, with anchor text)<br />- <strong></code>images.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All images from all pages (with alt text status)<br />- <strong></code>headings.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All headings from all pages (with levels)<br />- <strong></code>issues.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All issues by page with severity levels</p><p class="my-3">CSV files are RFC 4180 compliant and can be opened in Excel, Google Sheets, or any spreadsheet application.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- <strong>all-pages.csv</strong>: </code>url<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>status<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>title<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>titleLength<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>metaDescription<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>descriptionLength<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>h1Count<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>wordCount<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>internalLinks<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>externalLinks<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>images<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>imagesWithoutAlt<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>responseTime<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>redirects<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>canonicalUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>robotsDirectives<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>issuesCount<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>issues<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>links.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>linkUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>anchorText<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>rel<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>isInternal<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>isNofollow<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>status<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>images.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>imageSrc<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>altText<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>hasAlt<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>fileSize<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>headings.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>level<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>text<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>issues.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>issue<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>severity<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Architecture</h2></p><p class="my-3">The project is organized into modular components:</p><p class="my-3"></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />src/<br />├── cli.ts # CLI entry point with Commander<br />├── crawler.ts # Website crawling with performance tracking<br />├── parser.ts # Comprehensive HTML metadata extraction<br />├── analyzer.ts # Advanced SEO issue detection and categorization<br />├── reporter.ts # HTML report generation with Handlebars<br />├── exporter.ts # CSV export functionality (NEW)<br />├── types.ts # TypeScript type definitions<br />└── utils/<br /> └── urlUtils.ts # URL normalization and filtering</p><p class="my-3">templates/<br />├── summary.hbs # Interactive tabbed summary with sortable tables (NEW)<br />└── page.hbs # Enhanced page detail template with all metrics (NEW)<br /></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">1. <strong>Separation of Concerns</strong>: Each module has a single, well-defined responsibility<br />2. <strong>Memory Efficiency</strong>: Pages are parsed immediately after fetching; only metadata is stored<br />3. <strong>Error Resilience</strong>: Network and parsing errors don't stop the entire crawl<br />4. <strong>Extensibility</strong>: Modular design allows easy addition of features like JS rendering or new output formats</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Technology Stack</h2></p><p class="my-3">- <strong>TypeScript</strong>: Type-safe development<br />- <strong>Axios</strong>: HTTP client for page fetching<br />- <strong>htmlparser2 + css-select</strong>: Fast DOM-lite HTML parsing (low memory, high throughput)<br />- <strong>Commander</strong>: CLI framework<br />- <strong>Handlebars</strong>: HTML templating<br />- <strong>p-limit</strong>: Concurrency control<br />- <strong>Chalk & Ora</strong>: Beautiful CLI output</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">SEO Best Practices</h2></p><p class="my-3">This tool is based on SEO best practices from:</p><p class="my-3">- Google's Search Central documentation<br />- Industry-standard character limits for titles (60 chars) and descriptions (160 chars)<br />- Common SEO audit methodologies used by tools like Screaming Frog, Ahrefs, and SEMrush</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">- <strong>Unique Titles & Descriptions</strong>: Every page should have unique, descriptive metadata<br />- <strong>Optimal Length</strong>: Titles should be 20-60 characters, descriptions 50-160 characters<br />- <strong>Canonical Tags</strong>: Use self-referential canonicals to avoid duplicate content issues<br />- <strong>Robots Directives</strong>: Avoid conflicting directives; verify noindex pages are intentional<br />- <strong>Structured Data</strong>: Ensure JSON-LD is valid JSON and properly formatted<br />- <strong>hreflang</strong>: For multilingual sites, implement reciprocal hreflang tags</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Future Enhancements</h2></p><p class="my-3">Possible future enhancements:</p><p class="my-3">- 🌐 <strong>JavaScript Rendering</strong>: Support for SPAs using Puppeteer/Playwright<br />- 🤖 <strong>robots.txt Compliance</strong>: Automatic robots.txt parsing and adherence<br />- 🔗 <strong>Advanced Link Checking</strong>: Actually validate external links (not just detect 404s)<br />- 📈 <strong>Progress Tracking</strong>: Real-time crawl progress with ETA<br />- 🎨 <strong>Custom Report Themes</strong>: User-configurable report styling<br />- 🔌 <strong>Plugin System</strong>: Allow custom analyzers and reporters<br />- ☁️ <strong>Cloud Integration</strong>: Deploy as a web service or integrate with CI/CD pipelines<br />- 📊 <strong>Historical Tracking</strong>: Compare crawls over time to track improvements<br />- 🔍 <strong>Advanced Schema Validation</strong>: Validate JSON-LD against schema.org types<br />- 📱 <strong>Mobile vs Desktop</strong>: Compare mobile and desktop rendering</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Comparison to Screaming Frog</h2></p><p class="my-3">This tool now implements <strong>85-90% parity</strong> with Screaming Frog's core SEO analysis features (excluding external API dependencies):</p><p class="my-3">| Feature | This Tool | Screaming Frog |<br />|---------|-----------|----------------|<br />| <strong>Core Analysis</strong> |<br />| Page crawling | ✅ | ✅ |<br />| Title/Description analysis | ✅ (+ pixel width) | ✅ |<br />| Heading extraction (H1-H6) | ✅ (+ duplicates) | ✅ |<br />| Image alt text analysis | ✅ | ✅ |<br />| Internal/External links | ✅ | ✅ |<br />| Response times & redirects | ✅ | ✅ |<br />| Content metrics | ✅ (+ readability) | ✅ |<br />| Canonical URL analysis | ✅ (detailed) | ✅ |<br />| Robots directives | ✅ | ✅ |<br />| hreflang validation | ✅ (partial) | ✅ |<br />| <strong>Advanced Analysis</strong> |<br />| Security analysis | ✅ (HTTPS, headers, mixed content) | ✅ |<br />| URL quality checks | ✅ (15+ checks) | ✅ |<br />| Duplicate content detection | ✅ (exact + near) | ✅ |<br />| Orphan page detection | ✅ | ✅ |<br />| Weak anchor text | ✅ | ✅ |<br />| HTML validation | ✅ (structure, DOM depth) | ✅ |<br />| Pagination analysis | ✅ (partial) | ✅ |<br />| Soft 404 detection | ✅ | ✅ |<br />| Lorem ipsum detection | ✅ | ✅ |<br />| <strong>Export & Reporting</strong> |<br />| CSV export | ✅ (5+ files) | ✅ |<br />| Interactive HTML reports | ✅ | ❌ (static) |<br />| Severity-based filtering | ✅ | ✅ |<br />| <strong>Additional Features</strong> |<br />| Free & open source | ✅ | ❌ (freemium) |<br />| Command-line interface | ✅ | ✅ (paid) |<br />| Readability metrics | ✅ (3 formulas) | ❌ |<br />| Content-to-code ratio | ✅ | ✅ |<br />| JavaScript rendering | ❌ | ✅ |<br />| robots.txt validation | ✅ | ✅ |<br />| Sitemap analysis | ✅ | ✅ |<br />| PageSpeed/Lighthouse | ❌ | ✅ (paid) |<br />| Google Search Console | ❌ | ✅ (paid) |<br />| Google Analytics | ❌ | ✅ (paid) |<br />| External link checking | ❌ | ✅ |</p><p class="my-3"><strong>Summary</strong>: This tool implements 220+ SEO checks covering on-page SEO, content quality, security, URL quality, link analysis, schema validation, robots.txt compliance, and sitemap analysis. It excels at static HTML analysis but doesn't include JavaScript rendering or external API integrations (PageSpeed, GSC, GA). See <a href="docs/SCREAMING_FROG_PARITY.md" class="text-primary hover:underline" target="_blank" rel="noopener noreferrer"></code>docs/SCREAMING_FROG_PARITY.md<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></a> for details.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">⚠️ <strong>Important</strong>: This crawler analyzes <strong>static HTML only</strong> (like Screaming Frog's default mode). It does not execute JavaScript.</p><p class="my-3"><strong>Impact on External Scripts Detection:</strong><br />- ✅ Detects scripts in the initial HTML (</code><script src="...">` tags)<br />- ❌ Cannot detect scripts loaded dynamically by JavaScript after page load<br />- ❌ Client-side rendered apps (React, Vue, Angular, Gatsby, Next.js) may show 0 external scripts even if they load many at runtime</p><p class="my-3"><strong>For sites with dynamically-loaded scripts:</strong><br />- The "External Scripts" tab will show a notice explaining this limitation<br />- Check your browser's DevTools Network tab to see all scripts that load at runtime<br />- Consider using browser-based tools (Screaming Frog with JS rendering, Lighthouse, etc.) for full script analysis</p><p class="my-3">This is a trade-off for speed and simplicity—rendering JavaScript would significantly slow down crawling and require a headless browser.</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Contributing</h2></p><p class="my-3">Contributions are welcome! Please feel free to submit issues or pull requests.</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">License</h2></p><p class="my-3">ISC License</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Credits</h2></p><p class="my-3">Built with ❤️ by <a href="https://antler.digital" class="text-primary hover:underline" target="_blank" rel="noopener noreferrer">Antler Digital</a> using modern TypeScript and best-in-class Node.js libraries.</p><p class="my-3"></p></div><div class="flex justify-center absolute inset-x-0 bottom-0 bg-gradient-to-t from-background via-background to-transparent pb-4 pt-16"><button data-slot="button" class="inline-flex items-center justify-center whitespace-nowrap text-sm font-medium transition-all disabled:pointer-events-none disabled:opacity-50 [&_svg]:pointer-events-none [&_svg:not([class*='size-'])]:size-4 shrink-0 [&_svg]:shrink-0 outline-none focus-visible:border-ring focus-visible:ring-ring/50 focus-visible:ring-[3px] aria-invalid:ring-destructive/20 dark:aria-invalid:ring-destructive/40 aria-invalid:border-destructive border bg-background shadow-xs hover:bg-accent hover:text-accent-foreground dark:bg-input/30 dark:border-input dark:hover:bg-input/50 h-8 rounded-md gap-1.5 px-3 has-[>svg]:px-2.5"><svg xmlns="http://www.w3.org/2000/svg" width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-chevron-down mr-1 h-4 w-4"><path d="m6 9 6 6 6-6"></path></svg>Show more</button></div></div></div><template id="P:3"></template><template id="P:4"></template></div></div><div class="space-y-8"><div class="flex flex-col gap-6 lg:flex-row lg:gap-8"><div class="flex-1 space-y-4"><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-10 w-64"></div><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-6 w-full max-w-xl"></div><div class="flex gap-2"><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-6 w-16"></div><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-6 w-20"></div><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-6 w-14"></div></div></div><div class="w-full lg:w-80"><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-32 w-full"></div></div></div><div class="space-y-4"><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-10 w-64"></div><div data-slot="skeleton" class="bg-accent animate-pulse rounded-md h-64 w-full"></div></div></div></main></div><script>requestAnimationFrame(function(){$RT=performance.now()});</script><script src="/_next/static/chunks/9b9784636e791b20.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd" id="_R_" async=""></script><script>(self.__next_f=self.__next_f||[]).push([0])</script><script>self.__next_f.push([1,"1:\"$Sreact.fragment\"\n2:I[39756,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"default\"]\n3:I[37457,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"default\"]\n4:I[49786,[\"/_next/static/chunks/d25c1c87321c1e5e.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d23d08e0b5d1d215.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/ae55acb14c32e044.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"Header\"]\n5:I[22016,[\"/_next/static/chunks/d25c1c87321c1e5e.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d23d08e0b5d1d215.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/ae55acb14c32e044.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"\"]\nc:I[68027,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"default\"]\nd:I[2355,[\"/_next/static/chunks/d25c1c87321c1e5e.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"Analytics\"]\nf:I[97367,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"OutletBoundary\"]\n10:\"$Sreact.suspense\"\n12:I[97367,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"ViewportBoundary\"]\n14:I[97367,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"MetadataBoundary\"]\n:HL[\"/_next/static/chunks/60ba853f5ed1b16b.css?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"style\"]\n:HL[\"/_next/static/chunks/9c76f6dd8b5c38e2.css?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"style\"]\n:HL[\"/_next/static/media/68d403cf9f2c68c5-s.p.f9f15f61.woff2\",\"font\",{\"crossOrigin\":\"\",\"type\":\"font/woff2\"}]\n:HL[\"/_next/static/media/797e433ab948586e-s.p.dbea232f.woff2\",\"font\",{\"crossOrigin\":\"\",\"type\":\"font/woff2\"}]\n:HL[\"/_next/static/media/caa3a2e1cccd8315-s.p.853070df.woff2\",\"font\",{\"crossOrigin\":\"\",\"type\":\"font/woff2\"}]\n"])</script><script>self.__next_f.push([1,"0:{\"P\":null,\"b\":\"x_DymHuNhDTJUp7Mmr_Ru\",\"c\":[\"\",\"package\",\"seo-reporter\"],\"q\":\"\",\"i\":false,\"f\":[[[\"\",{\"children\":[\"package\",{\"children\":[[\"name\",\"seo-reporter\",\"c\"],{\"children\":[\"__PAGE__\",{}]}]}]},\"$undefined\",\"$undefined\",true],[[\"$\",\"$1\",\"c\",{\"children\":[[[\"$\",\"link\",\"0\",{\"rel\":\"stylesheet\",\"href\":\"/_next/static/chunks/60ba853f5ed1b16b.css?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"precedence\":\"next\",\"crossOrigin\":\"$undefined\",\"nonce\":\"$undefined\"}],[\"$\",\"link\",\"1\",{\"rel\":\"stylesheet\",\"href\":\"/_next/static/chunks/9c76f6dd8b5c38e2.css?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"precedence\":\"next\",\"crossOrigin\":\"$undefined\",\"nonce\":\"$undefined\"}],[\"$\",\"script\",\"script-0\",{\"src\":\"/_next/static/chunks/d25c1c87321c1e5e.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"async\":true,\"nonce\":\"$undefined\"}]],[\"$\",\"html\",null,{\"lang\":\"en\",\"children\":[\"$\",\"body\",null,{\"className\":\"font-sans antialiased\",\"children\":[[\"$\",\"$L2\",null,{\"parallelRouterKey\":\"children\",\"error\":\"$undefined\",\"errorStyles\":\"$undefined\",\"errorScripts\":\"$undefined\",\"template\":[\"$\",\"$L3\",null,{}],\"templateStyles\":\"$undefined\",\"templateScripts\":\"$undefined\",\"notFound\":[[\"$\",\"div\",null,{\"className\":\"min-h-screen bg-background\",\"children\":[[\"$\",\"$L4\",null,{}],[\"$\",\"main\",null,{\"className\":\"mx-auto flex max-w-md flex-col items-center justify-center px-4 py-24 text-center\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-package mb-6 h-16 w-16 text-muted-foreground\",\"children\":[[\"$\",\"path\",\"1a0edw\",{\"d\":\"M11 21.73a2 2 0 0 0 2 0l7-4A2 2 0 0 0 21 16V8a2 2 0 0 0-1-1.73l-7-4a2 2 0 0 0-2 0l-7 4A2 2 0 0 0 3 8v8a2 2 0 0 0 1 1.73z\"}],[\"$\",\"path\",\"d0xqtd\",{\"d\":\"M12 22V12\"}],[\"$\",\"path\",\"yx3hmr\",{\"d\":\"m3.3 7 7.703 4.734a2 2 0 0 0 1.994 0L20.7 7\"}],[\"$\",\"path\",\"1c824w\",{\"d\":\"m7.5 4.27 9 5.15\"}],\"$undefined\"]}],[\"$\",\"h1\",null,{\"className\":\"mb-2 text-3xl font-bold\",\"children\":\"404\"}],[\"$\",\"h2\",null,{\"className\":\"mb-4 text-xl text-muted-foreground\",\"children\":\"Page not found\"}],[\"$\",\"p\",null,{\"className\":\"mb-8 text-muted-foreground\",\"children\":\"The page you're looking for doesn't exist or has been moved.\"}],[\"$\",\"div\",null,{\"className\":\"flex gap-3\",\"children\":[[\"$\",\"$L5\",null,{\"href\":\"/\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-house mr-1.5 h-4 w-4\",\"children\":[[\"$\",\"path\",\"5wwlr5\",{\"d\":\"M15 21v-8a1 1 0 0 0-1-1h-4a1 1 0 0 0-1 1v8\"}],[\"$\",\"path\",\"1d0kgt\",{\"d\":\"M3 10a2 2 0 0 1 .709-1.528l7-5.999a2 2 0 0 1 2.582 0l7 5.999A2 2 0 0 1 21 10v9a2 2 0 0 1-2 2H5a2 2 0 0 1-2-2z\"}],\"$undefined\"]}],\"Go home\"],\"data-slot\":\"button\",\"className\":\"inline-flex items-center justify-center gap-2 whitespace-nowrap rounded-md text-sm font-medium transition-all disabled:pointer-events-none disabled:opacity-50 [\u0026_svg]:pointer-events-none [\u0026_svg:not([class*='size-'])]:size-4 shrink-0 [\u0026_svg]:shrink-0 outline-none focus-visible:border-ring focus-visible:ring-ring/50 focus-visible:ring-[3px] aria-invalid:ring-destructive/20 dark:aria-invalid:ring-destructive/40 aria-invalid:border-destructive bg-primary text-primary-foreground hover:bg-primary/90 h-9 px-4 py-2 has-[\u003esvg]:px-3\",\"ref\":null}],[\"$\",\"$L5\",null,{\"href\":\"/search\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-search mr-1.5 h-4 w-4\",\"children\":[[\"$\",\"circle\",\"4ej97u\",{\"cx\":\"11\",\"cy\":\"11\",\"r\":\"8\"}],[\"$\",\"path\",\"1qie3q\",{\"d\":\"m21 21-4.3-4.3\"}],\"$undefined\"]}],\"Search packages\"],\"data-slot\":\"button\",\"className\":\"inline-flex items-center justify-center gap-2 whitespace-nowrap rounded-md text-sm font-medium transition-all disabled:pointer-events-none disabled:opacity-50 [\u0026_svg]:pointer-events-none [\u0026_svg:not([class*='size-'])]:size-4 shrink-0 [\u0026_svg]:shrink-0 outline-none focus-visible:border-ring focus-visible:ring-ring/50 focus-visible:ring-[3px] aria-invalid:ring-destructive/20 dark:aria-invalid:ring-destructive/40 aria-invalid:border-destructive border bg-background shadow-xs hover:bg-accent hover:text-accent-foreground dark:bg-input/30 dark:border-input dark:hover:bg-input/50 h-9 px-4 py-2 has-[\u003esvg]:px-3\",\"ref\":null}]]}]]}]]}],[]],\"forbidden\":\"$undefined\",\"unauthorized\":\"$undefined\"}],\"$L6\"]}]}]]}],{\"children\":[\"$L7\",{\"children\":[\"$L8\",{\"children\":[\"$L9\",{},null,false,false]},[\"$La\",[],[]],false,false]},null,false,false]},null,false,false],\"$Lb\",false]],\"m\":\"$undefined\",\"G\":[\"$c\",[]],\"S\":false}\n"])</script><script>self.__next_f.push([1,"6:[\"$\",\"$Ld\",null,{}]\n7:[\"$\",\"$1\",\"c\",{\"children\":[null,[\"$\",\"$L2\",null,{\"parallelRouterKey\":\"children\",\"error\":\"$undefined\",\"errorStyles\":\"$undefined\",\"errorScripts\":\"$undefined\",\"template\":[\"$\",\"$L3\",null,{}],\"templateStyles\":\"$undefined\",\"templateScripts\":\"$undefined\",\"notFound\":\"$undefined\",\"forbidden\":\"$undefined\",\"unauthorized\":\"$undefined\"}]]}]\n8:[\"$\",\"$1\",\"c\",{\"children\":[null,[\"$\",\"$L2\",null,{\"parallelRouterKey\":\"children\",\"error\":\"$undefined\",\"errorStyles\":\"$undefined\",\"errorScripts\":\"$undefined\",\"template\":[\"$\",\"$L3\",null,{}],\"templateStyles\":\"$undefined\",\"templateScripts\":\"$undefined\",\"notFound\":\"$undefined\",\"forbidden\":\"$undefined\",\"unauthorized\":\"$undefined\"}]]}]\n9:[\"$\",\"$1\",\"c\",{\"children\":[\"$Le\",[[\"$\",\"script\",\"script-0\",{\"src\":\"/_next/static/chunks/d23d08e0b5d1d215.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"async\":true,\"nonce\":\"$undefined\"}],[\"$\",\"script\",\"script-1\",{\"src\":\"/_next/static/chunks/ae55acb14c32e044.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"async\":true,\"nonce\":\"$undefined\"}]],[\"$\",\"$Lf\",null,{\"children\":[\"$\",\"$10\",null,{\"name\":\"Next.MetadataOutlet\",\"children\":\"$@11\"}]}]]}]\na:null\nb:[\"$\",\"$1\",\"h\",{\"children\":[null,[\"$\",\"$L12\",null,{\"children\":\"$L13\"}],[\"$\",\"div\",null,{\"hidden\":true,\"children\":[\"$\",\"$L14\",null,{\"children\":[\"$\",\"$10\",null,{\"name\":\"Next.Metadata\",\"children\":\"$L15\"}]}]}],[\"$\",\"meta\",null,{\"name\":\"next-size-adjust\",\"content\":\"\"}]]}]\n"])</script><script>self.__next_f.push([1,"e:[\"$\",\"div\",null,{\"className\":\"min-h-screen bg-background\",\"children\":[[\"$\",\"$L4\",null,{}],[\"$\",\"main\",null,{\"className\":\"mx-auto max-w-6xl px-4 py-8\",\"children\":[\"$\",\"$10\",null,{\"fallback\":[\"$\",\"div\",null,{\"className\":\"space-y-8\",\"children\":[[\"$\",\"div\",null,{\"className\":\"flex flex-col gap-6 lg:flex-row lg:gap-8\",\"children\":[[\"$\",\"div\",null,{\"className\":\"flex-1 space-y-4\",\"children\":[[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-10 w-64\"}],[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-6 w-full max-w-xl\"}],[\"$\",\"div\",null,{\"className\":\"flex gap-2\",\"children\":[[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-6 w-16\"}],[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-6 w-20\"}],[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-6 w-14\"}]]}]]}],[\"$\",\"div\",null,{\"className\":\"w-full lg:w-80\",\"children\":[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-32 w-full\"}]}]]}],[\"$\",\"div\",null,{\"className\":\"space-y-4\",\"children\":[[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-10 w-64\"}],[\"$\",\"div\",null,{\"data-slot\":\"skeleton\",\"className\":\"bg-accent animate-pulse rounded-md h-64 w-full\"}]]}]]}],\"children\":\"$L16\"}]}]]}]\n"])</script><script>self.__next_f.push([1,"13:[[\"$\",\"meta\",\"0\",{\"charSet\":\"utf-8\"}],[\"$\",\"meta\",\"1\",{\"name\":\"viewport\",\"content\":\"width=device-width, initial-scale=1\"}]]\n"])</script><script>self.__next_f.push([1,"17:I[27201,[\"/_next/static/chunks/ff1a16fafef87110.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\",\"/_next/static/chunks/d2be314c3ece3fbe.js?dpl=dpl_5VNVLvSu6UbYcDkvZSHQ251MMiDd\"],\"IconMark\"]\n11:null\n15:[[\"$\",\"title\",\"0\",{\"children\":\"seo-reporter - npm explorer\"}],[\"$\",\"meta\",\"1\",{\"name\":\"description\",\"content\":\"High-performance SEO analysis CLI with 220+ checks, Rust-powered duplicate detection, worker thread parallelization, and clean progress UI. Analyzes 1000+ page sites in under 2 minutes. Achieves ~85% Screaming Frog parity.\"}],[\"$\",\"link\",\"2\",{\"rel\":\"icon\",\"href\":\"/icon-light-32x32.png\",\"media\":\"(prefers-color-scheme: light)\"}],[\"$\",\"link\",\"3\",{\"rel\":\"icon\",\"href\":\"/icon-dark-32x32.png\",\"media\":\"(prefers-color-scheme: dark)\"}],[\"$\",\"link\",\"4\",{\"rel\":\"icon\",\"href\":\"/icon.svg\",\"type\":\"image/svg+xml\"}],[\"$\",\"link\",\"5\",{\"rel\":\"apple-touch-icon\",\"href\":\"/apple-icon.png\"}],[\"$\",\"$L17\",\"6\",{}]]\n"])</script><title>seo-reporter - npm explorer

seo-reporter

v2.2.0TypeScript

High-performance SEO analysis CLI with 220+ checks, Rust-powered duplicate detection, worker thread parallelization, and clean progress UI. Analyzes 1000+ page sites in under 2 minutes. Achieves ~85% Screaming Frog parity.

seo crawler metadata audit cli

142/weekUpdated 2 days agoISCUnpacked: 2.0 MB

Published by antler-digital

npm install seo-reporter

npm

SEO Reporter

Quick Start

``bash

`Install dependencies`


pnpm install
Run the crawler

pnpm dev --url https://example.com
View results

open seo-report/index.html


Features
$3

- 🕷️ Website Crawling: Breadth-first traversal with configurable depth and concurrency
- 📊 220+ SEO Checks: Implements 85-90% parity with Screaming Frog's core analysis 
- ⚡ Fast & Efficient: Concurrent crawling with configurable request limits and memory-efficient processing
- 📝 Interactive HTML Reports: Beautiful, filterable reports with severity-based issue categorization (now with working filters, sortable columns, default severity sort, live filtered totals, a new Issues-by-Type view with drill-down, and a reorganized nav with a Content dropdown)
- 💡 Actionable Tooltips: Hover over any issue to see specific fix recommendations (25+ guidance tips). Improved (i) tooltips now render consistently across the report.
- 📱 Mobile Responsive: All reports fully responsive with touch-friendly interfaces (44px tap targets)
- 🖥️ Built-in Server: Zero-dependency static file server using Node.js built-ins (

seo-reporter serve

)
- 🔌 JSON API Routes: Complete RESTful-style JSON API for all SEO data - perfect for integrations and custom dashboards
- 🧭 Clickable Site Structure: Navigate from the site structure tree directly to per-page details
- 📤 CSV Export: Export all data to Excel-compatible CSV files for further analysis
- 🤖 Robots.txt Integration: Automatic robots.txt parsing and compliance
- 🗺️ Sitemap Analysis: Auto-detects and analyzes XML sitemaps (runs by default)
$3

- Comprehensive On-Page Data: Titles, descriptions, headings, canonical URLs, robots directives
- Social Media Tags: Open Graph, Twitter Cards
- Internationalization: hreflang attributes with validation
- Structured Data: JSON-LD & microdata extraction
- Links: Internal/external links with anchor text, nofollow detection
- Images: Alt text, dimensions tracking
- Content Metrics: Word count, text length, HTML size, content-to-code ratio
- Performance: Response times, redirect chains
- Scripts: External JavaScript detection with async/defer tracking
$3

- Protocol Security: HTTP vs HTTPS detection
- Mixed Content: HTTP resources on HTTPS pages
- Insecure Forms: Form actions over HTTP
- Security Headers: HSTS, CSP, X-Frame-Options, X-Content-Type-Options, Referrer-Policy
- Protocol-Relative URLs: Detection of

 URLs
$3

- URL Issues: Multiple slashes, spaces, non-ASCII characters, uppercase letters
- URL Structure: Repetitive paths, overly long URLs (>2083 chars)
- Parameters: Query params, tracking params, internal search URLs
- Fragment URLs: Detection of fragment-only links
$3

- Duplicate Detection: 
  - Exact duplicates (SHA-256 hash-based)
  - Near duplicates (>90% similarity using MinHash)
  - Duplicate H1/H2 across pages
- Content Analysis: 
  - Lorem ipsum detection
  - Soft 404 detection
  - Readability metrics (Flesch-Kincaid Grade, Reading Ease, ARI)
  - Poor readability warnings (>12th grade level)
  - Thin content detection (<300 words)
$3

- Titles: Missing, duplicate, too long/short (characters + pixel width), multiple tags, outside

, identical to H1
- Meta Descriptions: Missing, duplicate, too long/short (characters + pixel width), multiple tags, outside


- Headings: Missing H1, multiple H1, broken hierarchy, overly long (>70 chars), empty headings
- Canonical: Multiple/conflicting tags, relative URLs, fragments, outside

, invalid attributes
- Robots: Conflicting directives, noindex/nofollow detection
- Indexability Tracking: Comprehensive analysis of why pages are/aren't indexable
  - Non-200 status codes (404, 500, etc.)
  -

noindex

 in meta robots tag
  -

noindex

 in X-Robots-Tag header  
  - Canonical pointing to different URL
  - Detailed reasons shown in Issues tab and individual page reports
- Pagination: rel="next"/rel="prev" validation, multiple pagination links, sequence errors
$3

- Anti-Detection Crawling: Bypass basic bot detection systems with realistic browser simulation
- User Agent Rotation: 20+ realistic user agents from Chrome, Firefox, Safari, Edge across Windows, macOS, and Linux
- Header Randomization: Dynamic browser headers with realistic patterns and Chrome sec-ch-ua headers
- Human-Like Timing: Intelligent delays (1-8 seconds) simulating quick, normal, and slow browsing patterns  
- Proxy Support: Rotate through multiple proxy servers with automatic failover and validation
- Session Management: Maintain consistent headers across requests for realistic browsing simulation
- Custom Configuration: Define your own user agents, proxies, and timing patterns
- Seamless Integration: Works with all existing crawl modes and analysis features
$3

- 404 Tracking: 
  - Dedicated 404 Pages tab with referrer tracking
  - Shows which pages link to each 404 (now normalized so

/path and /path/

 are treated the same)
  - Helps identify and fix broken internal links
- Link Quality: 
  - Orphan pages (no internal inlinks)
  - Dead ends (no outlinks)
  - Weak anchor text ("click here", empty, too short)
  - Localhost links (127.0.0.1)
  - Missing protocol on external links (e.g.,

facebook.com without https://

)
- Link Metrics: 
  - Internal vs external link counts
  - High outlink warnings (>100 internal, >50 external)
  - Inlink count per page
  - Crawl depth distribution
$3

- Document Structure: Missing/multiple

or

 tags
- Element Positioning: Tags outside

 that should be inside
- Document Order: Incorrect

/

 ordering
- Size & Complexity: Large HTML (>1MB), excessive DOM depth (>30 levels)
- Invalid Elements: Elements that shouldn't be in


$3

- 🔴 High Severity: Missing titles/H1s, HTTP pages, mixed content, insecure forms, soft 404s, lorem ipsum, malformed HTML
- 🟡 Medium Severity: Title/description length issues, multiple H1s, images without alt, thin content, slow pages, security headers missing
- 🔵 Low Severity: Heading hierarchy, redirect chains, URL quality issues, readability warnings, informational notices
Performance ⚡
SEO Reporter includes a Rust-powered native module for near-duplicate content detection, providing massive performance gains for large sites:
$3
| Pages | TypeScript (O(n²)) | Rust + LSH (O(n)) | Speedup |
|-------|-------------------|-------------------|---------|
| 100   | ~10s              | ~0.1s             | 100x |
| 500   | ~2.5min           | ~0.5s             | 300x |
| 1000  | ~10min            | ~1s               | 600x |
| 5000  | ~4 hours          | ~5s               | ~3000x |
$3
The Rust module uses Locality-Sensitive Hashing (LSH) with MinHash signatures:
- Generates 128-hash MinHash signatures for each page
- Groups pages into buckets using 16 bands × 8 rows
- Only compares pages that share at least one bucket (candidates)
- Reduces comparisons from O(n²) to O(n) with ~95% accuracy
$3
If the Rust module fails to load (unsupported platform or not built), the tool automatically falls back to the pure TypeScript implementation, ensuring compatibility on all platforms.
⚠️ Rust Warning: When the Rust module is unavailable, the CLI displays a clear warning:

`⚠️ Rust native module not available - using TypeScript fallback for near-duplicate detection Note: Near-duplicate detection will be slower. Runnpm rebuildto build the Rust module.`

This helps users understand why near-duplicate detection may be slower and provides actionable instructions to enable the faster implementation.

`$3`

Pre-built Rust binaries are included for: - macOS (Intel x64 & Apple Silicon ARM64) - Linux (x64 & ARM64, glibc & musl) - Windows (x64)

`Installation`

`$3`

- Node.js 18 or higher - pnpm (recommended) or npm - Rust 1.70+ (optional, only needed if building from source; pre-built binaries included)

`$3`

`bash pnpm install`

`$3`

For maximum performance, install Rust to enable the native module:

`bash

`Automatic Rust installation (Windows, macOS, Linux)`


pnpm setup:rust


This will:
- Download and install rustup (Rust toolchain installer)
- Install the latest stable Rust toolchain
- Set up the environment for building the native module
- Verify the installation
$3

`bash pnpm build`

`$3`

The Rust module provides 100-1000x faster near-duplicate detection. The build process automatically handles Rust environment setup:

`bash

`Install Rust automatically (Windows, macOS, Linux)`


pnpm setup:rust
Full build (Rust + TypeScript)

pnpm build
Rust module only

pnpm build:rust-only
TypeScript only (fallback if Rust unavailable)

pnpm build:ts-only

Note: The build scripts automatically source the Rust environment ($HOME/.cargo/env) if available. Pre-built binaries are included for most platforms, but you can rebuild if needed.

`$3`

`bash

`Development mode (no build required)`


pnpm dev --url https://example.com
Production mode (requires build)

pnpm start --url https://example.com

$3

`bash pnpm install -g . seo-reporter --url https://example.com`

`Usage`

`$3`

`bash

`Crawl and analyze a website`


seo-reporter crawl --url https://example.com
Or use the legacy format (still supported)

seo-reporter --url https://example.com
Start the report server (no URL needed)

seo-reporter serve

$3

`bash

`Crawl command`


seo-reporter crawl \
  --url https://example.com \           # Required: Target URL to crawl
  --depth 3 \                            # Optional: Max crawl depth (default: 3)
  --max-pages 1000 \                     # Optional: Max pages to crawl (default: 1000)
  --concurrency 10 \                     # Optional: Concurrent requests (default: 10)
  --output ./seo-report \                # Optional: Output directory (default: ./seo-report)
  --timeout 10000 \                      # Optional: Request timeout in ms (default: 10000)
  --user-agent "CustomBot/1.0" \         # Optional: Custom user agent
  --export-csv \                         # Optional: Export results to CSV files
  --respect-robots \                     # Respect robots.txt (default: true)
  --ignore-robots \                      # Ignore robots.txt rules
  --crawl-mode both \                    # Optional: Crawl mode - crawl|sitemap|both (default: both)
  --sitemap-url https://example.com/sitemap.xml \  # Custom sitemap URL
  --validate-schema \                    # Validate JSON-LD schema.org data
  --stealth \                            # Enable stealth mode with randomized headers and timing
  --stealth-user-agents "Agent1,Agent2" \ # Custom user agents for stealth mode
  --stealth-min-delay 1000 \             # Minimum delay between requests in stealth mode (ms)
  --stealth-max-delay 5000 \             # Maximum delay between requests in stealth mode (ms)
  --stealth-proxies "proxy1:8080,proxy2:3128"  # Proxy rotation for stealth mode
Serve command

seo-reporter serve \
  --port 8080 \                          # Optional: Port to listen on (default: 8080)
  ./seo-report                           # Optional: Directory to serve (default: ./seo-report)

$3

The --crawl-mode option controls how the tool discovers pages:

- crawl- Follow links only (traditional crawling) -sitemap- Crawl only URLs found in sitemap(s) -both - Crawl sitemap URLs + follow links (default, discovers maximum pages)

Example:`bash

`Only crawl URLs from sitemap`


seo-reporter --url https://example.com --crawl-mode sitemap
Traditional link-based crawling only

seo-reporter --url https://example.com --crawl-mode crawl
Both (default)

seo-reporter --url https://example.com --crawl-mode both


$3
After generating a report, you can view it in two ways:

Option 1: Open directly in browser`bash open seo-report/index.html`

Option 2: Start a local server (Recommended)`bash

`Using the built-in server`


seo-reporter serve seo-report
Or with a custom port

seo-reporter serve seo-report --port 3000
Using npm script

pnpm serve  # Serves ./seo-report on port 8080

The built-in server uses Node.js's native http module (zero dependencies, works everywhere).

`$3`

When running the tool, you'll see detailed progress for each phase:

`bash $ seo-reporter --url https://example.com --max-pages 100

🔍 SEO Reporter

Configuration: URL: https://example.com/ Max Depth: 3 Max Pages: 100 Concurrency: 10 Output: ./seo-report

⠹ Crawling website... 🟢 25/100 pages ⠸ Crawling website... 🟢 50/100 pages ⠼ Crawling website... 🟢 100/100 pages ✔ Crawled 100 pages in 15.2s

⠹ Parsing SEO metadata... 25/100 pages ⠸ Parsing SEO metadata... 50/100 pages ⠼ Parsing SEO metadata... 100/100 pages ✔ Parsed metadata from 100 pages in 3.4s

✔ Sitemap analyzed (95 URLs in sitemap)

📊 Issues Found: ⚠️ 5 pages with missing meta descriptions ⚠️ 3 pages with duplicate titles ...`

Note: The progress counters (e.g., 25/100) show real-time progress during crawling, parsing, and analysis phases, making it easy to estimate remaining time.

`$3`

Crawl a small site with depth 2:`bash seo-reporter --url https://myblog.com --depth 2 --max-pages 100`

Fast crawl with high concurrency:`bash seo-reporter --url https://example.com --concurrency 20 --depth 2`

Crawl and save to custom directory:`bash seo-reporter --url https://example.com --output ./reports/example-audit`

Crawl with CSV export:`bash seo-reporter --url https://example.com --export-csv`

Stealth mode crawling:`bash

`Basic stealth mode`


seo-reporter --url https://example.com --stealth
Stealth with custom timing

seo-reporter --url https://example.com --stealth --stealth-min-delay 2000 --stealth-max-delay 8000
Stealth with custom user agents and proxies

seo-reporter --url https://example.com --stealth \
  --stealth-user-agents "Mozilla/5.0 (Custom Bot),Another Custom Agent" \
  --stealth-proxies "proxy1.example.com:8080,proxy2.example.com:3128"


SEO Issues Detected
The tool checks for the following SEO issues:
$3

- ❌ Missing Title Tags: Pages without a

<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> tag<br />- ❌ <strong>Broken Links</strong>: Pages returning 404 status codes<br />- ❌ <strong>Conflicting Robots Directives</strong>: Multiple robots tags with contradictory instructions (e.g., "index" and "noindex")<br />- ❌ <strong>Multiple Canonical Tags</strong>: Pages with conflicting canonical URLs<br />- ❌ <strong>Malformed JSON-LD</strong>: Structured data scripts with JSON parsing errors</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- ⚠️ <strong>Missing Meta Descriptions</strong>: Pages without meta description tags<br />- ⚠️ <strong>Duplicate Titles</strong>: Multiple pages sharing the same title text<br />- ⚠️ <strong>Duplicate Descriptions</strong>: Multiple pages sharing the same meta description<br />- ⚠️ <strong>Title Too Long</strong>: Titles over 60 characters (may be truncated in search results)<br />- ⚠️ <strong>Title Too Short</strong>: Titles under 20 characters (may not be descriptive enough)<br />- ⚠️ <strong>Description Too Long</strong>: Meta descriptions over 160 characters (may be truncated)<br />- ⚠️ <strong>Description Too Short</strong>: Meta descriptions under 50 characters (may not be informative enough)<br />- ⚠️ <strong>Missing H1 Tags</strong>: Pages without an H1 heading<br />- ⚠️ <strong>Multiple H1 Tags</strong>: Pages with more than one H1 heading<br />- ⚠️ <strong>Improper Heading Hierarchy</strong>: Heading levels that skip numbers (e.g., H1 to H3)<br />- ⚠️ <strong>Images Without Alt Text</strong>: Images missing accessibility alt attributes<br />- ⚠️ <strong>Thin Content</strong>: Pages with less than 300 words<br />- ⚠️ <strong>Slow Page Load</strong>: Pages with response times over 3 seconds</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- ℹ️ <strong>Noindex Pages</strong>: Pages set to noindex (verify if intentional)<br />- ℹ️ <strong>Redirect Chains</strong>: Pages with redirect chains detected<br />- ℹ️ <strong>Multiple Title/Description Tags</strong>: Single page with duplicate meta tags</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- All headings (H1-H6) extracted and analyzed for proper hierarchy<br />- Internal vs external link analysis<br />- Image alt text coverage<br />- Word count and content density metrics<br />- Open Graph and Twitter Card metadata presence<br />- hreflang implementation<br />- JSON-LD and microdata structured data detection</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Output Reports</h2></p><p class="my-3">The tool generates comprehensive reports in the specified output directory:</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- <strong></code>index.html<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Interactive summary report with:<br /> - <strong>Tabbed Interface</strong>: Overview, All Pages, Site Structure, Links, Content, Performance, Scripts, Sitemap, Issues, and <strong>API</strong> tabs<br /> - <strong>Sortable Tables</strong>: Click column headers to sort data ascending/descending<br /> - <strong>Filterable Content</strong>: Search boxes to quickly find specific pages, links, or issues<br /> - <strong>Visual Statistics</strong>: Color-coded cards showing issue counts and severity<br /> - <strong>All Data</strong>: Links analysis (internal/external), headings, images, performance metrics<br />- <strong></code>page-viewer.html<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Dynamic page detail viewer that loads data from JSON files<br /> - Displays complete page metadata, issues, headings, links, and images<br /> - Loads data on-demand from JSON API routes<br /> - Accessed via </code>page-viewer.html?url=<page-url><code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3">Reports are fully self-contained with inline CSS and JavaScript - no external dependencies.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">All SEO data is available as JSON files for programmatic access, integrations, or custom dashboards:</p><p class="my-3">#### Individual Page Data<br />- <strong></code>json/pages/{filename}.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Complete page metadata including:<br /> - Title, meta description, canonical URL, robots directives<br /> - All headings (H1-H6), links, and images<br /> - Content metrics (word count, HTML size, readability scores)<br /> - Performance metrics (response time, redirects)<br /> - Security analysis (HTTPS, headers, mixed content)<br /> - URL quality metrics<br /> - Structured data (JSON-LD, microdata)<br /> - All detected issues with severity levels<br />- <strong></code>json/issues/{filename}.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Page-specific issues with severity counts</p><p class="my-3">#### Aggregate Data Endpoints<br />- <strong></code>json/all-pages.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Summary of all pages with key metrics<br />- <strong></code>json/all-issues.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All issues across all pages<br />- <strong></code>json/issues-summary.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Issues statistics by severity and type<br />- <strong></code>json/links.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All internal and external links<br />- <strong></code>json/images.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All images with alt text status<br />- <strong></code>json/headings.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All headings with levels<br />- <strong></code>json/performance.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Performance metrics for all pages<br />- <strong></code>json/external-scripts.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: External JavaScript usage analysis<br />- <strong></code>json/404-pages.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: 404 pages with referrer tracking<br />- <strong></code>json/sitemap-info.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Sitemap analysis data<br />- <strong></code>json/site-structure.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Site structure tree<br />- <strong></code>json/url-index.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: URL to filename mapping for easy lookups</p><p class="my-3">#### Using the JSON API</p><p class="my-3"></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">bash<br /><h1 class="text-2xl font-bold mt-6 mb-4">Generate report</h1><br />seo-reporter --url https://example.com</p><p class="my-3"><h1 class="text-2xl font-bold mt-6 mb-4">Access JSON data programmatically</h1><br />curl http://localhost:8000/seo-report/json/all-issues.json<br />curl http://localhost:8000/seo-report/json/pages/index.json<br />curl http://localhost:8000/seo-report/json/issues-summary.json</p><p class="my-3"><h1 class="text-2xl font-bold mt-6 mb-4">Or use in your application</h1><br />fetch('./seo-report/json/all-pages.json')<br /> .then(res => res.json())<br /> .then(data => console.log(data.pages));<br /></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3">#### In-Report API Tab</p><p class="my-3">Open the <strong>API</strong> tab in </code>index.html<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> for an at-a-glance list of endpoints, example curl/JS usage, and tips on mapping URLs to filenames via </code>json/url-index.json<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">. The tab now reliably renders with the updated tab switching logic.</p><p class="my-3">The JSON API is perfect for:<br />- CI/CD pipeline integrations<br />- Custom dashboards and visualizations<br />- Automated monitoring and alerts<br />- Data analysis and reporting scripts<br />- Integration with other SEO tools</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">- Same-domain redirects are followed and analyzed. After redirects within the same domain, links are resolved against the final URL to ensure correct internal/external classification.<br />- Cross-domain redirects are not analyzed or crawled. The redirect chain is recorded for the original URL, but the destination page’s content and links are not fetched or followed.</p><p class="my-3">#### Large Site Performance (10k+ pages)</p><p class="my-3">- For large datasets, reports now use chunked JSONP files and a small client runtime to progressively render big tables.<br />- Tables support pagination, sorting, filtering, and a page-size selector (25/50/100/250/500).<br />- Data files are written to </code>seo-report/data/…/*.js<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> and work offline via </code>file://<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> (no fetch).<br />- Sorting or filtering may trigger background loading of remaining chunks for accuracy.<br />- Small sites still render inline immediately; large sites render almost instantly and stream in data.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />When using </code>--export-csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, the tool generates Excel-compatible CSV files in the </code>csv/<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"> subdirectory:<br />- <strong></code>all-pages.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: Complete page data with all metrics<br />- <strong></code>links.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All links from all pages (internal/external, with anchor text)<br />- <strong></code>images.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All images from all pages (with alt text status)<br />- <strong></code>headings.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All headings from all pages (with levels)<br />- <strong></code>issues.csv<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></strong>: All issues by page with severity levels</p><p class="my-3">CSV files are RFC 4180 compliant and can be opened in Excel, Google Sheets, or any spreadsheet application.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3><br />- <strong>all-pages.csv</strong>: </code>url<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>status<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>title<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>titleLength<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>metaDescription<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>descriptionLength<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>h1Count<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>wordCount<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>internalLinks<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>externalLinks<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>images<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>imagesWithoutAlt<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>responseTime<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>redirects<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>canonicalUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>robotsDirectives<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>issuesCount<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>issues<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>links.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>linkUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>anchorText<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>rel<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>isInternal<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>isNofollow<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>status<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>images.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>imageSrc<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>altText<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>hasAlt<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>fileSize<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>headings.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>level<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>text<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />- <strong>issues.csv</strong>: </code>pageUrl<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>issue<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono">, </code>severity<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Architecture</h2></p><p class="my-3">The project is organized into modular components:</p><p class="my-3"></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"><br />src/<br />├── cli.ts # CLI entry point with Commander<br />├── crawler.ts # Website crawling with performance tracking<br />├── parser.ts # Comprehensive HTML metadata extraction<br />├── analyzer.ts # Advanced SEO issue detection and categorization<br />├── reporter.ts # HTML report generation with Handlebars<br />├── exporter.ts # CSV export functionality (NEW)<br />├── types.ts # TypeScript type definitions<br />└── utils/<br /> └── urlUtils.ts # URL normalization and filtering</p><p class="my-3">templates/<br />├── summary.hbs # Interactive tabbed summary with sortable tables (NEW)<br />└── page.hbs # Enhanced page detail template with all metrics (NEW)<br /></code>`<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">1. <strong>Separation of Concerns</strong>: Each module has a single, well-defined responsibility<br />2. <strong>Memory Efficiency</strong>: Pages are parsed immediately after fetching; only metadata is stored<br />3. <strong>Error Resilience</strong>: Network and parsing errors don't stop the entire crawl<br />4. <strong>Extensibility</strong>: Modular design allows easy addition of features like JS rendering or new output formats</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Technology Stack</h2></p><p class="my-3">- <strong>TypeScript</strong>: Type-safe development<br />- <strong>Axios</strong>: HTTP client for page fetching<br />- <strong>htmlparser2 + css-select</strong>: Fast DOM-lite HTML parsing (low memory, high throughput)<br />- <strong>Commander</strong>: CLI framework<br />- <strong>Handlebars</strong>: HTML templating<br />- <strong>p-limit</strong>: Concurrency control<br />- <strong>Chalk & Ora</strong>: Beautiful CLI output</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">SEO Best Practices</h2></p><p class="my-3">This tool is based on SEO best practices from:</p><p class="my-3">- Google's Search Central documentation<br />- Industry-standard character limits for titles (60 chars) and descriptions (160 chars)<br />- Common SEO audit methodologies used by tools like Screaming Frog, Ahrefs, and SEMrush</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">- <strong>Unique Titles & Descriptions</strong>: Every page should have unique, descriptive metadata<br />- <strong>Optimal Length</strong>: Titles should be 20-60 characters, descriptions 50-160 characters<br />- <strong>Canonical Tags</strong>: Use self-referential canonicals to avoid duplicate content issues<br />- <strong>Robots Directives</strong>: Avoid conflicting directives; verify noindex pages are intentional<br />- <strong>Structured Data</strong>: Ensure JSON-LD is valid JSON and properly formatted<br />- <strong>hreflang</strong>: For multilingual sites, implement reciprocal hreflang tags</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Future Enhancements</h2></p><p class="my-3">Possible future enhancements:</p><p class="my-3">- 🌐 <strong>JavaScript Rendering</strong>: Support for SPAs using Puppeteer/Playwright<br />- 🤖 <strong>robots.txt Compliance</strong>: Automatic robots.txt parsing and adherence<br />- 🔗 <strong>Advanced Link Checking</strong>: Actually validate external links (not just detect 404s)<br />- 📈 <strong>Progress Tracking</strong>: Real-time crawl progress with ETA<br />- 🎨 <strong>Custom Report Themes</strong>: User-configurable report styling<br />- 🔌 <strong>Plugin System</strong>: Allow custom analyzers and reporters<br />- ☁️ <strong>Cloud Integration</strong>: Deploy as a web service or integrate with CI/CD pipelines<br />- 📊 <strong>Historical Tracking</strong>: Compare crawls over time to track improvements<br />- 🔍 <strong>Advanced Schema Validation</strong>: Validate JSON-LD against schema.org types<br />- 📱 <strong>Mobile vs Desktop</strong>: Compare mobile and desktop rendering</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Comparison to Screaming Frog</h2></p><p class="my-3">This tool now implements <strong>85-90% parity</strong> with Screaming Frog's core SEO analysis features (excluding external API dependencies):</p><p class="my-3">| Feature | This Tool | Screaming Frog |<br />|---------|-----------|----------------|<br />| <strong>Core Analysis</strong> |<br />| Page crawling | ✅ | ✅ |<br />| Title/Description analysis | ✅ (+ pixel width) | ✅ |<br />| Heading extraction (H1-H6) | ✅ (+ duplicates) | ✅ |<br />| Image alt text analysis | ✅ | ✅ |<br />| Internal/External links | ✅ | ✅ |<br />| Response times & redirects | ✅ | ✅ |<br />| Content metrics | ✅ (+ readability) | ✅ |<br />| Canonical URL analysis | ✅ (detailed) | ✅ |<br />| Robots directives | ✅ | ✅ |<br />| hreflang validation | ✅ (partial) | ✅ |<br />| <strong>Advanced Analysis</strong> |<br />| Security analysis | ✅ (HTTPS, headers, mixed content) | ✅ |<br />| URL quality checks | ✅ (15+ checks) | ✅ |<br />| Duplicate content detection | ✅ (exact + near) | ✅ |<br />| Orphan page detection | ✅ | ✅ |<br />| Weak anchor text | ✅ | ✅ |<br />| HTML validation | ✅ (structure, DOM depth) | ✅ |<br />| Pagination analysis | ✅ (partial) | ✅ |<br />| Soft 404 detection | ✅ | ✅ |<br />| Lorem ipsum detection | ✅ | ✅ |<br />| <strong>Export & Reporting</strong> |<br />| CSV export | ✅ (5+ files) | ✅ |<br />| Interactive HTML reports | ✅ | ❌ (static) |<br />| Severity-based filtering | ✅ | ✅ |<br />| <strong>Additional Features</strong> |<br />| Free & open source | ✅ | ❌ (freemium) |<br />| Command-line interface | ✅ | ✅ (paid) |<br />| Readability metrics | ✅ (3 formulas) | ❌ |<br />| Content-to-code ratio | ✅ | ✅ |<br />| JavaScript rendering | ❌ | ✅ |<br />| robots.txt validation | ✅ | ✅ |<br />| Sitemap analysis | ✅ | ✅ |<br />| PageSpeed/Lighthouse | ❌ | ✅ (paid) |<br />| Google Search Console | ❌ | ✅ (paid) |<br />| Google Analytics | ❌ | ✅ (paid) |<br />| External link checking | ❌ | ✅ |</p><p class="my-3"><strong>Summary</strong>: This tool implements 220+ SEO checks covering on-page SEO, content quality, security, URL quality, link analysis, schema validation, robots.txt compliance, and sitemap analysis. It excels at static HTML analysis but doesn't include JavaScript rendering or external API integrations (PageSpeed, GSC, GA). See <a href="docs/SCREAMING_FROG_PARITY.md" class="text-primary hover:underline" target="_blank" rel="noopener noreferrer"></code>docs/SCREAMING_FROG_PARITY.md<code class="bg-muted px-1 py-0.5 rounded text-sm font-mono"></a> for details.</p><p class="my-3"><h3 class="text-lg font-medium mt-4 mb-2">$3</h3></p><p class="my-3">⚠️ <strong>Important</strong>: This crawler analyzes <strong>static HTML only</strong> (like Screaming Frog's default mode). It does not execute JavaScript.</p><p class="my-3"><strong>Impact on External Scripts Detection:</strong><br />- ✅ Detects scripts in the initial HTML (</code><script src="...">` tags)<br />- ❌ Cannot detect scripts loaded dynamically by JavaScript after page load<br />- ❌ Client-side rendered apps (React, Vue, Angular, Gatsby, Next.js) may show 0 external scripts even if they load many at runtime</p><p class="my-3"><strong>For sites with dynamically-loaded scripts:</strong><br />- The "External Scripts" tab will show a notice explaining this limitation<br />- Check your browser's DevTools Network tab to see all scripts that load at runtime<br />- Consider using browser-based tools (Screaming Frog with JS rendering, Lighthouse, etc.) for full script analysis</p><p class="my-3">This is a trade-off for speed and simplicity—rendering JavaScript would significantly slow down crawling and require a headless browser.</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Contributing</h2></p><p class="my-3">Contributions are welcome! Please feel free to submit issues or pull requests.</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">License</h2></p><p class="my-3">ISC License</p><p class="my-3"><h2 class="text-xl font-semibold mt-5 mb-3">Credits</h2></p><p class="my-3">Built with ❤️ by <a href="https://antler.digital" class="text-primary hover:underline" target="_blank" rel="noopener noreferrer">Antler Digital</a> using modern TypeScript and best-in-class Node.js libraries.</p><p class="my-3"></p></div><div class="flex justify-center absolute inset-x-0 bottom-0 bg-gradient-to-t from-background via-background to-transparent pb-4 pt-16"><button data-slot="button" class="inline-flex items-center justify-center whitespace-nowrap text-sm font-medium transition-all disabled:pointer-events-none disabled:opacity-50 [&_svg]:pointer-events-none [&_svg:not([class*='size-'])]:size-4 shrink-0 [&_svg]:shrink-0 outline-none focus-visible:border-ring focus-visible:ring-ring/50 focus-visible:ring-[3px] aria-invalid:ring-destructive/20 dark:aria-invalid:ring-destructive/40 aria-invalid:border-destructive border bg-background shadow-xs hover:bg-accent hover:text-accent-foreground dark:bg-input/30 dark:border-input dark:hover:bg-input/50 h-8 rounded-md gap-1.5 px-3 has-[>svg]:px-2.5"><svg xmlns="http://www.w3.org/2000/svg" width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-chevron-down mr-1 h-4 w-4"><path d="m6 9 6 6 6-6"></path></svg>Show more</button></div></div></div><template id="P:3"></template><template id="P:4"></template></div></div></div><script>self.__next_f.push([1,"28:[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}]\n29:[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^14.0.1\"}]\n2a:[\"$\",\"$L5\",\"css-select\",{\"href\":\"/package/css-select\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"css-select\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^5.1.0\"}]]}]\n2b:[\"$\",\"$L5\",\"dictionary-en\",{\"href\":\"/package/dictionary-en\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"dictionary-en\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^3.2.0\"}]]}]\n2c:[\"$\",\"$L5\",\"domutils\",{\"href\":\"/package/domutils\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"domutils\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^3.1.0\"}]]}]\n2d:[\"$\",\"$L5\",\"fast-xml-parser\",{\"href\":\"/package/fast-xml-parser\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"fast-xml-parser\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^4.3.2\"}]]}]\n2e:[\"$\",\"$L5\",\"handlebars\",{\"href\":\"/package/handlebars\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text"])</script><script>self.__next_f.push([1,"-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"handlebars\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^4.7.8\"}]]}]\n2f:[\"$\",\"$L5\",\"htmlparser2\",{\"href\":\"/package/htmlparser2\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"htmlparser2\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^9.1.0\"}]]}]\n30:[\"$\",\"$L5\",\"nspell\",{\"href\":\"/package/nspell\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"nspell\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^2.1.5\"}]]}]\n31:[\"$\",\"$L5\",\"ora\",{\"href\":\"/package/ora\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"ora\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^9.0.0\"}]]}]\n32:[\"$\",\"$L5\",\"p-limit\",{\"href\":\"/package/p-limit\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"p-limit\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^7.1.1\"}]]}]\n33:[\"$\",\"$L5\",\"robots-parser\",{\"href\":\"/package/robots-parser\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":"])</script><script>self.__next_f.push([1,"\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"robots-parser\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^3.0.1\"}]]}]\n"])</script><script>self.__next_f.push([1,"34:[\"$\",\"div\",null,{\"data-slot\":\"card\",\"className\":\"bg-card text-card-foreground flex flex-col gap-6 rounded-xl border shadow-sm py-4 lg:col-span-2\",\"children\":[[\"$\",\"div\",null,{\"data-slot\":\"card-header\",\"className\":\"@container/card-header grid auto-rows-min grid-rows-[auto_auto] items-start gap-2 px-6 has-data-[slot=card-action]:grid-cols-[1fr_auto] [.border-b]:pb-6 pb-2\",\"children\":[\"$\",\"div\",null,{\"data-slot\":\"card-title\",\"className\":\"font-semibold flex items-center gap-2 text-base\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-wrench h-4 w-4\",\"children\":[[\"$\",\"path\",\"cbrjhi\",{\"d\":\"M14.7 6.3a1 1 0 0 0 0 1.4l1.6 1.6a1 1 0 0 0 1.4 0l3.77-3.77a6 6 0 0 1-7.94 7.94l-6.91 6.91a2.12 2.12 0 0 1-3-3l6.91-6.91a6 6 0 0 1 7.94-7.94l-3.76 3.76z\"}],\"$undefined\"]}],\"Dev Dependencies\",[\"$\",\"span\",null,{\"data-slot\":\"badge\",\"className\":\"inline-flex items-center justify-center rounded-md border px-2 py-0.5 font-medium w-fit whitespace-nowrap shrink-0 [\u0026\u003esvg]:size-3 gap-1 [\u0026\u003esvg]:pointer-events-none focus-visible:border-ring focus-visible:ring-ring/50 focus-visible:ring-[3px] aria-invalid:ring-destructive/20 dark:aria-invalid:ring-destructive/40 aria-invalid:border-destructive transition-[color,box-shadow] overflow-hidden border-transparent bg-secondary text-secondary-foreground [a\u0026]:hover:bg-secondary/90 ml-auto text-xs\",\"children\":4}]]}]}],[\"$\",\"div\",null,{\"data-slot\":\"card-content\",\"className\":\"px-6\",\"children\":[\"$\",\"div\",null,{\"className\":\"grid gap-1 sm:grid-cols-2 lg:grid-cols-3\",\"children\":[[\"$\",\"$L5\",\"@napi-rs/cli\",{\"href\":\"/package/%40napi-rs%2Fcli\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"@napi-rs/cli\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^3.3.1\"}]]}],[\"$\",\"$L5\",\"@types/node\",{\"href\":\"/package/%40types%2Fnode\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"@types/node\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^24.7.2\"}]]}],[\"$\",\"$L5\",\"tsx\",{\"href\":\"/package/tsx\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[\"$L36\",\"$undefined\"]}],\"tsx\"]}],\"$L37\"]}],\"$L38\"]}]}]]}]\n"])</script><script>self.__next_f.push([1,"36:[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}]\n37:[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^4.20.6\"}]\n38:[\"$\",\"$L5\",\"typescript\",{\"href\":\"/package/typescript\",\"className\":\"group flex items-center justify-between rounded-md px-2 py-1.5 text-sm transition-colors hover:bg-accent\",\"children\":[[\"$\",\"span\",null,{\"className\":\"flex items-center gap-1.5 truncate font-mono text-foreground\",\"children\":[[\"$\",\"svg\",null,{\"ref\":\"$undefined\",\"xmlns\":\"http://www.w3.org/2000/svg\",\"width\":24,\"height\":24,\"viewBox\":\"0 0 24 24\",\"fill\":\"none\",\"stroke\":\"currentColor\",\"strokeWidth\":2,\"strokeLinecap\":\"round\",\"strokeLinejoin\":\"round\",\"className\":\"lucide lucide-chevron-right h-3 w-3 text-muted-foreground opacity-0 transition-opacity group-hover:opacity-100\",\"children\":[[\"$\",\"path\",\"mthhwq\",{\"d\":\"m9 18 6-6-6-6\"}],\"$undefined\"]}],\"typescript\"]}],[\"$\",\"span\",null,{\"className\":\"ml-2 shrink-0 font-mono text-xs text-muted-foreground\",\"children\":\"^5.9.3\"}]]}]\n"])</script><div hidden id="S:3"><div data-state="inactive" data-orientation="horizontal" role="tabpanel" aria-labelledby="radix-_R_aav5ubrb_-trigger-dependencies" hidden="" id="radix-_R_aav5ubrb_-content-dependencies" tabindex="0" data-slot="tabs-content" class="flex-1 outline-none mt-6"></div></div><script>$RS=function(a,b){a=document.getElementById(a);b=document.getElementById(b);for(a.parentNode.removeChild(a);a.firstChild;)b.parentNode.insertBefore(a.firstChild,b);b.parentNode.removeChild(b)};$RS("S:3","P:3")</script><div hidden id="S:4"><div data-state="inactive" data-orientation="horizontal" role="tabpanel" aria-labelledby="radix-_R_aav5ubrb_-trigger-versions" hidden="" id="radix-_R_aav5ubrb_-content-versions" tabindex="0" data-slot="tabs-content" class="flex-1 outline-none mt-6"></div></div><script>$RS("S:4","P:4")</script><script>$RC("B:1","S:1")</script></body></html>