Methodology
Cited samples up to 5 pages from your site and measures the visible word count on each. The average across the sample determines the score. We don’t penalize occasional short pages — a contact page or thin landing page won’t fail the signal on its own — but consistently thin content across the sample drags the average down. The scoring tiers are calibrated against AI citation patterns observed in production. Pages averaging:- Over 800 words earn the full 6/6 — this is the band where AI models confidently treat content as a single authoritative source.
- 500-800 words earn 4/6 — substantive but typically requires AI to combine your page with another source for full coverage.
- 300-499 words earn 2/6 — usually too thin for primary citation; AI may cite you as a supporting source after citing a more detailed page.
- Under 300 words earn 1/6 — flagged as critically thin. AI models routinely skip these in favor of competitors with deeper content.
Verification
You can verify our finding yourself with a one-line browser check. Step 1: Open the pages we sampled. Cited reports the URLs we tested. Open each in a new browser tab. Step 2: Count words via the console. Open DevTools (Cmd+Option+I / Ctrl+Shift+I), Console tab, and run:
Technical detail
Word count as a depth proxy traces to early SEO research (Backlinko, Ahrefs studies from 2016 onward) showing strong correlation between content length and search ranking. Modern AI citation studies show a similar pattern: AI models routinely prefer 1,000+ word sources over 300-word sources for substantive queries. The thresholds Cited uses are conservative — 800 words is well above the 600-word minimum many studies cite — to favor pages that clearly clear the bar over edge cases. Extraction logic. The scanner counts words on the rendered page:document.body.innerTextreturns the page’s visible text content after JavaScript hydration- The text is trimmed and split on whitespace (
/\s+/) - The resulting array length is the word count
- Empty pages (no visible text) report 0 words
avg > 800 → 6, avg >= 500 → 4, avg >= 300 → 2, else → 1. No interpolation between tiers — a 799-word average and a 600-word average both score 4.
Edge cases the scanner handles:
- Pages with no visible text — single-page apps that haven’t hydrated, blank error pages, paywall walls. These contribute 0 words to the average but still count as sampled pages, dragging the average down.
- Pages with massive chrome — sites with very long footers (link sitemaps, comprehensive nav) inflate the word count without contributing editorial depth. The scanner can’t distinguish.
- Pages with images-as-content — infographics, product detail images, photo galleries. The alt text and any visible captions count, but the visual content doesn’t. Image-heavy pages tend to underscore.
- Hidden content — text inside
display: noneor visibility-hidden containers doesn’t count becauseinnerTextexcludes hidden content by design. This includes collapsed accordion content visible only after a click. - Iframes — content inside iframes (embedded videos, third-party widgets, YouTube embeds) doesn’t count because each iframe is a separate document context.
- Content quality or accuracy. A page can be 2,000 words of off-topic ramble and score 6/6. The scanner counts words, not insight.
- Main-content depth specifically. Pages with thin articles but rich navigation score higher than they should. This is a known limitation.
- Unique vs duplicate content. A page with 1,200 words copied from another page on your site scores the same as 1,200 words of original content. Duplicate content is a separate concern that we don’t currently model.
- Reading level or readability. Word count is independent of how readable the content is. A 1,000-word page written at a PhD reading level scores the same as a 1,000-word page written for general audiences.