Scanner Benchmark Methodology Plugins Blog About Contact v0.15.1

Scoring Methodology

How FastAEOCheck measures your website's readiness for AI answer engines and generative engines. Every check, every point, and the research behind the weights.

Last updated: February 3, 2026 · Methodology v0.5

How the AEO Score works

Your AEO Score is a number between 0 and 100 that measures how well your website is optimized for AI answer engines — systems like ChatGPT, Google Gemini, Perplexity, and Claude that pull information from the web to answer user questions.

Unlike traditional SEO, which focuses on ranking in search results, AEO focuses on being cited, extracted, and referenced by AI systems. A site can rank #1 on Google and still score poorly on AEO if its content isn't structured for machine comprehension.

The score is a weighted sum of 7 categories, each targeting a distinct aspect of AI readiness. A separate GEO Score (Generative Engine Optimization) is displayed alongside, showing how ready your site is to be discovered and cited by AI-powered search engines.

CategoryMax PointsWeight
Structured Data & Schema2020%
Content Quality1818%
Technical SEO1717%
GEO Readiness 🆕1515%
AEO Readiness1212%
Authority & Trust1010%
Content Structure88%
Total100100%

Grade scale

Each category and the overall score maps to a letter grade:

A+90-100
A80-89
B70-79
C60-69
D50-59
F0-49

Category breakdown

📊 Structured Data & Schema

20 pts

JSON-LD schema markup provides explicit, machine-readable signals that AI systems use directly — not interpreted, not guessed, but consumed as structured facts.

5 ptsSchema.org markup present — Any valid JSON-LD or microdata detected on the page
5 ptsFAQPage schema — Q&A pairs that AI can extract and cite directly
3 ptsOrganization schema — Brand name, description, contact info structured for AI
7 ptsAdditional content schemas — Product, Article, Review, and HowTo schemas (2 pts each, capped at 7). Sites with 3+ schema types show ~13% higher AI citation likelihood.
Why 20%? Structured data is the single most direct signal you can give AI systems. Unlike content that requires natural language processing to understand, schema markup is unambiguous. Research by Conductor found that pages with structured data are 2-3x more likely to appear in rich results and AI-generated snippets.

📝 Content Quality

18 pts

Content depth, metadata, and on-page signals that determine whether AI systems consider your content worth citing.

4 ptsWord count / content depth — 800+ words for content sites, 500+ for product sites. Comprehensive content is cited more frequently.
3 ptsMeta description quality — Present and 120-160 characters. AI systems use meta descriptions to understand page purpose.
2 ptsTitle tag optimization — Present and 30-65 characters. Primary signal for AI topic understanding.
2 ptsH1 heading — Single, clear H1 that states the page's main topic
2 ptsH2 subheadings — At least 2 H2s indicating content has organized subtopics
5 ptsInner page metadata quality — Multi-page scans only. Bonus for inner pages with optimized meta descriptions (+0.5 pts/page) and title tags (+0.3 pts/page), capped at 5 pts. Penalizes missing metadata, thin content, and missing H1s.

For single-page scans, the maximum is 13 pts (homepage checks only). Multi-page scans can earn up to the full 18 pts with consistent metadata across all pages.

Why 18%? AirOps research found that pages with clean heading hierarchy and aligned schema earned 2.8× higher AI citation rates. Over 70% of AI-cited pages were updated within the last 12 months, indicating AI systems strongly prefer substantial, maintained content. Multi-page sites with consistent metadata score higher because AI crawlers evaluate site-wide quality, not just individual pages.

⚙️ Technical SEO

17 pts

The infrastructure that determines whether AI crawlers can find, access, and process your content at all. If technical access is broken, nothing else matters.

3 ptsrobots.txt — Present and not blocking AI crawlers. Controls bot access to your content.
3 ptsXML sitemap — Present and accessible. Helps AI crawlers discover all your content.
3 ptsPage load time — Under 1 second for full marks, under 3 seconds for partial. Fast sites are crawled more frequently.
3 ptsLanguage declarationlang attribute on HTML tag helps AI serve content to the right audience.
3 ptsImage alt text — 90%+ of images have alt text. AI systems use alt text for image understanding.
2 ptsCanonical URL — Set to prevent duplicate content issues for AI crawlers.
Why 17%? AirOps' AEO implementation checklist ranks technical access as the #1 priority: "Technical access issues block everything else — fix first." A site with great content but broken crawlability will never appear in AI answers.

🤖 AEO Readiness

12 pts

Signals specifically designed for AI answer engines — content patterns that match how users query AI systems, not traditional search engines.

5 ptsFAQ content — Dedicated Q&A sections detected via HTML patterns or FAQPage schema. The single most effective AEO tactic.
3 ptsQuestion-based headings — H2s starting with "What", "How", "Why", "When", "Who" or ending with "?" — matching natural AI query patterns.
2 ptsClear value proposition — OG description or substantial meta description providing concise, extractable summaries.
2 ptsOpen Graph tags — og:title and og:description present for when AI systems share or reference your content.
Why 12%? CXL research shows that FAQ content with question-and-answer structure captures featured snippets at 3x the rate of paragraph content. HubSpot's analysis confirms that question-format headings are the primary matching pattern for conversational AI queries. The llms.txt check has moved to the new GEO Readiness category.

🏗️ Content Structure

8 pts

How well your content is organized for machine parsing — heading hierarchy, linking structure, and topical coverage.

3 ptsHeading hierarchy — H1 → H2 → H3 structure with clear topical organization. AirOps research shows pages with clean heading hierarchy earn 2.8× higher AI citation rates.
2 ptsInternal linking — 5+ internal links connecting related content across your site
1 ptExternal links — Outbound links present indicating connected, referenced content
1 ptMulti-page coverage — Multiple content pages covering different facets of your topic (multi-page scans only)
1 ptMultilingual support — Hreflang tags for international AI query coverage (multi-page scans only)

For single-page scans, the maximum adjusts to 6 pts since multi-page coverage and hreflang checks cannot be evaluated.

Why 8%? Content structure is the scaffolding that makes other signals work. A well-structured page with clear heading hierarchy is easier for AI to parse into extractable sections, but structure alone without substance has limited value.

🌐 GEO Readiness NEW

15 pts

Generative Engine Optimization — whether AI-powered search engines like ChatGPT, Perplexity, Google AI Overviews, and Claude can access, understand, and choose to cite your content. While AEO focuses on content structure for extraction, GEO focuses on whether AI will actually discover and reference your site.

4 ptsAI crawler access — Parses robots.txt for rules affecting 16 known AI bots (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and others). Full marks for allowing all, zero for blocking all. Reports exactly which bots are allowed vs. blocked.
4 ptsllms.txt quality — A /llms.txt markdown file following the llmstxt.org spec. Scored on existence, word count (100+), section depth (2+ headings), and inclusion of markdown links. A comprehensive file scores full marks.
4 ptsCitation-worthiness signals — Content that AI engines prefer to cite: statistics and data points (1 pt), source citations like "according to" (1 pt), clear definitions (1 pt), and structured content with lists or tables (1 pt). Based on Princeton GEO research showing 30-40% visibility boosts.
3 ptsContent extractability — Semantic HTML structure that makes content parseable by AI: proper H1→H2→H3 hierarchy (1 pt), HTML lists with 3+ items (1 pt), and HTML tables (1 pt).
Why 15%? Princeton University's GEO research found that citation-style content boosts AI visibility by 30-40%: statistics increase citation likelihood by 37%, source references by 30%, and direct quotations by 28%. Meanwhile, blocking AI crawlers makes GEO impossible regardless of content quality. This category also drives the separate GEO Score displayed alongside the main AEO Score.

🛡️ Authority & Trust

10 pts

E-E-A-T signals — Experience, Expertise, Authoritativeness, and Trust. The qualitative indicators AI systems use to assess whether your content is credible enough to cite.

2 ptsHTTPS — Baseline trust signal. AI systems strongly prefer secure sites.
2 ptsAbout / Contact pages — Internal links to /about, /contact, /team, or /support detected. Transparency signals.
2 ptsContent freshness — datePublished and dateModified in JSON-LD, meta tags, <time> elements, or visible "Last updated" text.
2 ptsAuthor information — Meta author tag, Person schema, .author/.byline elements, or [itemprop="author"] markup.
2 ptsExternal citations — 3+ outbound links to external sources. Indicates well-researched, referenced content.
Why 10%? E-E-A-T is a qualitative trust overlay, not a hard ranking factor. It matters most for YMYL (Your Money Your Life) topics like health and finance. For most sites, it's the differentiator between sites that could be cited and sites that will be cited. AirOps found that 60% of AI Overview citations come from pages NOT in the top 20 organic results — meaning authority signals and content extractability matter more than traditional SEO rank.

Research and data sources

Our scoring methodology is informed by research from multiple industry sources analyzing how AI systems select, extract, and cite web content:

Princeton University (2024) — Landmark GEO research analyzing how generative engines select sources to cite. Found that adding statistics boosts visibility by 37%, citing sources by 30%, and including quotations by 28%. Content with citation-worthy signals consistently outperformed optimized-for-SEO-only content in AI-generated responses.

AirOps (2025) — Analysis of AI Overview citation patterns found that 60% of cited pages are outside the top 20 organic search results. Content freshness is critical: 70%+ of citations reference pages updated within 12 months.

Conductor — Research on structured data impact shows pages with schema markup are significantly more likely to appear in rich results and AI-generated snippets, with FAQPage schema showing the strongest citation correlation.

HubSpot — Content analysis demonstrates that long-form content (1,000+ words) receives 77% more backlinks. Question-format headings match conversational AI query patterns at higher rates than declarative headings.

CXL — FAQ content with explicit question-and-answer structure captures featured snippets at 3x the rate of unstructured paragraph content, making it the highest-impact single optimization for AI visibility.

First Page Sage / Hashmeta / SEO Grow — Cross-industry analysis confirming the five pillars of AEO: structured data, content quality, technical fundamentals, E-E-A-T authority signals, and answer-formatted content.

Single-page vs. multi-page scans

FastAEOCheck adapts its scoring based on scan depth. A single-page scan (homepage only) adjusts the maximum achievable score for categories where multi-page data is needed:

Content Quality: Max adjusts from 18 to 13 pts. A single homepage cannot be penalized for missing inner page metadata.

Content Structure: Max adjusts from 8 to 5 pts. Multi-page coverage and hreflang checks are excluded.

This ensures a well-optimized homepage can still achieve an A+ grade on a single-page scan without being unfairly penalized for missing multi-page signals.

Product sites vs. content sites

FastAEOCheck detects whether your site is a SaaS/product/e-commerce page or a content/editorial site, and adjusts Content Quality thresholds accordingly:

Product sites: 500+ words = full depth score, 150+ = partial. Product pages are naturally less text-heavy.

Content sites: 800+ words = full depth score, 300+ = partial. Editorial content should be comprehensive to compete for AI citations.

Data sources & benchmarks

FastAEOCheck benchmark data — including the European web benchmark and industry-specific audits — is built on domain rankings from the Tranco research list, a research-oriented top sites ranking designed to be hardened against manipulation. Tranco aggregates data from four independent providers: Cloudflare Radar, Majestic, the Chrome User Experience Report (CrUX), and Cisco Umbrella.

The scoring methodology draws from industry research including AirOps (AI citation patterns, content freshness data), Conductor (structured data impact), HubSpot (content depth and linking analysis), CXL (FAQ snippet capture rates), and First Page Sage / Hashmeta / SEO Grow (AEO pillar frameworks).

Academic citation: Le Pochat, V., Van Goethem, T., Tajalizadehkhoob, S., Korczyński, M., & Joosen, W. (2019). Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation. Proceedings of the Network and Distributed System Security Symposium (NDSS 2019). doi:10.14722/ndss.2019.23386

Check your AEO & GEO Score now

Free instant audit across all 7 categories with detailed findings, actionable recommendations, and a dedicated GEO readiness score.

Scan Your Site →

Frequently asked questions

How often should I re-scan my site?

After making changes based on your audit recommendations, re-scan to verify improvements. For ongoing monitoring, monthly scans catch regressions and track progress as AI systems evolve.

Why does my site score differently than Google PageSpeed or Lighthouse?

Different tools measure different things. PageSpeed measures performance. Lighthouse measures web vitals. FastAEOCheck measures AI answer engine readiness — whether AI systems can find, understand, and cite your content. A site can score 100 on Lighthouse and 20 on AEO if it has no structured data or FAQ content.

What's more important — fixing red (fail) or improving yellow (warning)?

Fix fails first, especially those marked HIGH impact. Technical access issues (robots.txt, HTTPS) block everything else. Then structured data. Then content optimizations. This matches AirOps' recommended implementation timeline: technical first (immediately), content structure (0-30 days), schema (30-60 days), authority building (60-90+ days).

Does a high AEO score guarantee AI will cite my site?

No. AEO readiness increases your probability of being cited, but AI systems also consider content relevance, topical authority, and the specific query being asked. Think of AEO optimization as removing barriers — a high score means you've eliminated the technical and structural reasons an AI might skip your content.

How is AEO different from SEO?

SEO optimizes for ranking position in search results. AEO optimizes for being cited and extracted by AI answer engines. Key differences: AEO prioritizes structured data over backlinks, values answer-formatted content over keyword density, and focuses on extractability over click-through rates. Many SEO best practices help AEO, but AEO has unique requirements like FAQ schema and question-based headings.

What is GEO and how does it differ from AEO?

GEO (Generative Engine Optimization) focuses on whether AI-powered search engines like ChatGPT, Perplexity, and Google AI Overviews will discover and cite your content. While AEO focuses on structuring content for extraction, GEO addresses the upstream questions: can AI crawlers access your site, does your content contain citation-worthy signals (statistics, sources, definitions), and is your HTML semantically structured for AI parsing? The GEO Score is displayed separately alongside the main AEO Score.