Word count, headings, paragraphs (structural dimensions):
Word count: Word count tiers are calibrated from Zhang et al. (2026, preprint — not yet peer-reviewed), which found high-influence pages were on average 11.44× longer than low-influence pages. The specific word count thresholds are the tool's internal calibration.
Headings: Heading count tiers are calibrated from Zhang et al. (2026, preprint — not yet peer-reviewed), which found high-influence pages had 12.50× more headings. Specific tier thresholds are the tool's internal calibration.
Paragraphs: Paragraph count tiers are calibrated from Zhang et al. (2026, preprint — not yet peer-reviewed), which found high-influence pages had 5.69× more paragraphs. Specific tier thresholds are the tool's internal calibration.
A separate preprint study (arXiv:2603.29979, 'Structural Feature Engineering for Generative Engine Optimization', 2026) provides independent corroboration for these structural dimensions. That study tested structurally distinct variants of the same content — identical semantic payload, only structure changed — across six generative engines, and found structural changes alone produced a mean 17.3% improvement in citation rate. This is controlled-experimental evidence that structural features have a measurable effect on citation outcomes, independent of semantic content changes.
Definition sentences:
Definition sentence detection is based on Zhang et al. (2026, preprint — not yet peer-reviewed), which found pages with high definitional content showed approximately 57% higher absorption. This is a page-level finding applied as a document-level dimension — an informed inference, not a directly measured sentence-level effect.
Comparative sentences:
Comparative sentence detection is based on Zhang et al. (2026, preprint — not yet peer-reviewed), which found comparative content was associated with approximately 55% higher absorption. Page-level finding applied as a document-level dimension.
Statistics & numeric evidence:
Statistics presence is associated with approximately 61% higher absorption and code/numeric content with approximately 76.88% higher absorption in Zhang et al. (2026, preprint — not yet peer-reviewed). These are absorption-side measurements: they measure how much a source shapes AI-generated answer language. A separate peer-reviewed study (Aggarwal et al. 2024) found approximately +31% for citation selection probability — this is a different measurement type on a different phenomenon. The two figures are not comparable and are not presented alongside each other in this tool's scoring.