Why six metrics and not one composite score?

Each of the six captures a property the others do not, and the metrics often disagree on the same source for the same prompt set. A composite score averages those disagreements into one number that hides which engine is moving and which content property is responsible. The trade-off cost (six dashboards instead of one) is justified when measurements drive content decisions; the diagnostic information is in the per-metric breakouts, not in the average.

Do the six metrics work the same across engines?

No, and the cross-engine asymmetries are large enough that a per-engine breakout is structural, not optional. Citation match rate effectively equals attribution rate on Google AI Overview (every source-panel entry is linked) but diverges on ChatGPT / Claude / Gemini where link state depends on session settings. Per-engine measurement differences show up in adjacent data too: in Ahrefs' August 2025 long-tail study, the overlap between AI-cited URLs and Google's top 10 for the same query was ~29% on Perplexity vs ~7-8% on ChatGPT / Gemini / Copilot. The exact gap varies by which metric you compute, but the underlying point holds: one aggregated cross-engine number hides the structural differences that drive optimization decisions. Each anchor entry's per-engine notes spell out the engine-specific caveats.

How do the six metrics relate to traditional SEO metrics?

Loose analogies, not direct replacements. Attribution rate is closest to impression share but at the AI-response level instead of SERP level. Citation share is the AI-search analog of share-of-voice but built on citation events instead of impression positions. Citation match rate is closest to a 'linked vs unlinked mention' split (which has no exact SEO counterpart). Cite-ability is closest to content-quality scoring but operationalized for AI extraction, not human readability. Citation velocity and citation rotation have no clean SEO counterparts: SERP ranks are tracked per query but not as time-derivatives or temporal-stability measures at the citation level.

Are any of the six defined in vendor or academic literature?

None of the six AI-search operationalizations are defined in current vendor documentation or peer-reviewed academic literature. Citation velocity inherits the metric shape from academic bibliometrics (Garfield 1955 onward) but the AI-engine application is recent and practitioner-coined. The other five are practitioner-coined for the AI search context; several vendor blogs (UltraScout, Rankeo, Steakhouse, Profound, Otterly, Peec) publish related framings, typically with looser operational definitions than this cluster uses. Tools differ on counting rules; cross-tool numbers should not be assumed apples-to-apples.

/terms/ai-citation-metrics · 21 min read · intermediate

AI citation metrics

Cluster pillar

AI citation metrics for GEO measurement organize into six independent dimensions: three output ratios (attribution rate, citation share, citation match rate), one input content property (cite-ability), and two temporal signals (citation velocity, citation rotation). Each captures a property the others do not. Each requires per-engine breakout to be useful. None is defined in vendor documentation or peer-reviewed academic literature for the AI-search context; the cluster is practitioner-coined and synthesized across GEO measurement practice.

Citation status

ChatGPT0×Perplexity·Claude0×Copilot0×Gemini0×

Last checked 2026-06-29

Why one number isn't enough

Most GEO measurement programs reach for a composite "AI visibility score" as the headline KPI; Profound, Peec AI, Otterly, AthenaHQ, and Ahrefs Brand Radar all surface composite-style headline scores in their public marketing. Composite scores are convenient, but they hide the structural per-engine gaps and content-property dynamics that drive citation decisions, and they collapse measurements that often disagree on the same source for the same prompt set. The cross-engine asymmetries are large enough that aggregation hides what matters: in Ahrefs' August 2025 long-tail study, the percentage of AI-cited URLs that also ranked in Google's top 10 for the same query was ~29% on Perplexity vs ~7-8% on ChatGPT / Gemini / Copilot¹. Whatever metric you choose, an aggregated cross-engine number averages signals from systems that behave structurally differently. The diagnostic information is in the per-metric, per-engine breakouts, not in the average.

GEO Glossary tracks the AI citation surface as six independent measurement dimensions, organized into three groups:

Output ratios (three metrics): how often, against whom, and how cleanly your source is cited.
- Attribution rate: the percentage of evaluated AI-engine responses that cite a specific source for a fixed prompt set.
- Citation share: the relative percentage of citations a source receives versus competitors for the same prompt set.
- Citation match rate: the percentage of AI-engine references to a source that include a clickable link back.
Input content property (one metric): the upstream property that makes a passage extractable, quotable, and attributable.
- Cite-ability: a practitioner-coined content property describing how suitable a passage is for AI extraction, quotation, and attribution.
Temporal signals (two metrics): how the citation surface changes over time.
- Citation velocity: the rate at which new AI-engine citations to a source accumulate over time; a temporal leading indicator paired with attribution rate (different units: a count per window vs a ratio), useful when attribution rate is flat but underlying citation flow is changing.
- Citation rotation: the rate at which the sources an AI search engine cites for a given query change over time; the temporal-stability dimension.

None of the six AI-search operationalizations is defined in current vendor documentation or peer-reviewed academic literature²³. Each definition below is practitioner-coined and synthesized across GEO measurement practice. Each requires a per-engine breakout to be useful. Each has a denominator decision that needs to be locked in advance and held stable across measurement windows.

The six metrics in detail

The rest of this pillar walks each metric in turn: what it measures, the operational decisions that define it, and what it does not capture. The anchor entry for each metric is the authoritative reference for the deep definition, the verbatim formula, and the per-engine caveats; this pillar focuses on how the six fit together as a measurement framework.

Output ratios

Attribution rate: the headline KPI

Attribution rate answers the simplest measurement question: was my source cited? It is the percentage of evaluated AI-engine responses that cite a specific source or domain for a defined prompt set. The metric is conceptually related to the citation-counting and visibility measures in the 2023 Princeton GEO paper², though the per-engine and per-query-set operationalization here is the cluster's own extension, not the paper's.

The formula: (responses that cite the target source) ÷ (total evaluated responses) × 100%.

Three denominator decisions to lock in advance: per engine vs aggregated (always report per engine first; engines differ structurally and aggregation hides it), all probed responses vs only responses with at least one citation (the stricter denominator is more cross-engine comparable), and verbatim linked citation vs name mention (tighter definitions require the engine to actually link the source).

Attribution rate is the simplest entry point into the measurement framework; it returns a usable per-engine baseline within one weekly probe round of a 10-query prompt set. It is also the metric most other metrics in this cluster depend on: citation share normalizes against the same prompt set's total citations, citation match rate normalizes against the same source's own attributed references, citation velocity is its time-derivative, and citation rotation tracks the same probed source list across time.

Citation share: how big a slice of the pie

Citation share is the relative version: where attribution rate measures one source against all queries, citation share measures one source against other sources for the same queries. The formula: (target source citation instances) ÷ (total citation instances across all sources for those queries) × 100%.

Three additional axes to lock before reporting: URL vs domain vs brand level (the same entity yields three different shares depending on aggregation), deduplication rule (one count per response vs raw counts), and per-engine vs cross-engine vs traffic-weighted aggregation.

Citation share is the metric that supports competitive analysis. It is what tells you whether a topic is fragmented (no single source holds more than ~15% share, an enterable cluster) or concentrated (one domain holds 60%+ share, an entrenched canonical you should route around). It is the AI-search analog of traditional share of voice (not the direct equivalent: SEO share of voice is built on ranking impressions, citation share is built on counted citation events).

Citation match rate: the linked subset

Citation match rate counts only the linked subset of attribution: references that include a clickable URL back to the source. The formula: (linked citations) ÷ (all attributed references) × 100%.

The metric matters because linked and unlinked references have different downstream effects. A linked reference creates a direct click path; an unlinked reference contributes to brand awareness and may drive delayed effects (branded search, dark traffic, eventual direct visits) but does not create a click path. Reporting "mentioned vs cited" as a single binary loses the linked-prose-mention case where an engine names a brand inline without an attached URL.

Citation match rate effectively equals attribution rate on Google AI Overview (every source-panel entry is linked). On ChatGPT / Claude / Gemini the match rate depends primarily on whether web search or grounding is enabled per session, not on model version. Per-engine match-rate goals are the right framing; chasing one universal target collapses real engine-by-engine variation.

Input content property

Cite-ability: the upstream property the output ratios measure

Cite-ability is the only metric in the cluster that lives in the content, not in the measurement system. It is a practitioner-coined content property describing how suitable a passage is for AI extraction, quotation, and attribution. Where attribution rate / citation share / citation match rate measure the outcome of how engines treat the content, cite-ability measures the input property of the content itself.

There is no standardized cite-ability metric or scoring tool. Practitioners use a four-trait heuristic: a self-contained claim, an unambiguous subject, a sourceable assertion, and ideally memorable phrasing. The "context-free quote" test (copy a paragraph into a fresh AI chat with no surrounding context, ask the model to restate it) is the most common informal scoring method.

Cite-ability is the lever the other five output metrics react to. A program that improves attribution rate without changing cite-ability is leaning on distribution and authority, not on content. A program that improves cite-ability without seeing the output ratios move is either too early in the engine indexing cycle or working in a topic cluster already saturated by stronger sources. The two-sided signal (input property changes vs output ratio responses) is more diagnostic than either side alone.

Temporal signals

Citation velocity: the temporal leading indicator

Citation velocity is the rate at which new AI-engine citations to a source accumulate over time. The basic formula: citation velocity (period N) = new citation instances observed in period N (per fixed window length). The metric is often described as "the time-derivative of attribution rate", but the two have different units (a ratio vs a count per time) and the relationship is colloquial rather than a strict mathematical derivative. The practical claim is that velocity often moves before attribution rate does, not that one is the literal derivative of the other.

The metric inherits its shape from academic bibliometrics (Garfield 1955 onward³) but the AI-engine application is a recent extension. Several vendor blogs (UltraScout, Rankeo, Steakhouse) have published "citation velocity" definitions for AI search with looser, often single-window or competitor-relative framings; no major GEO tracking tool currently exposes velocity as a named field in a public-facing dashboard.

Velocity is the leading indicator to attribution rate's lagging indicator. A flat attribution rate over multiple weeks can mask both high-churn (new sources displacing old) and low-churn (stable repeats) realities. Velocity, computed as new-citation counts per probe round (distinct from acceleration, which is the period-to-period change in velocity), separates the two. Practitioner experience suggests attribution-rate shifts may be preceded by 2 to 6 weeks of velocity acceleration; this is a monitoring hypothesis, not a benchmark, and has not been empirically established in published research.

Three operational decisions: window length (weekly minimum because daily windows show too much noise from probe timing), novelty definition (a five-fold typology of first-seen / new-vs-previous-window / recovered / persistent / lost, not just a binary new vs persistent split), and per-engine breakout (engines have different ingestion lags; aggregate velocity hides single-engine spikes).

Citation rotation: the temporal-stability dimension

Citation rotation is the rate at which the sources an AI search engine cites for a given query change over time. In high-rotation windows, the cited-source set may change weekly or faster; in low-rotation windows, the same top sources persist for months. Rotation is the temporal-stability dimension that pairs with velocity's temporal-accumulation dimension.

The concept appears in the literature under several names. Citation volatility is the most common industry term (used for events like the widely-reported late-2025 episode in which ChatGPT's Reddit citation share fell from roughly 60% to roughly 10% over approximately six weeks, attributed by industry coverage to Google's removal of the num=100 search parameter). Source pool cycling and source rotation are alternative names; citation persistence is the inverse property. The citation-rotation entry uses the synthesis name for parallel framing with the rest of the cluster.

Three measurement disciplines: lock prompt set, cadence, and engine list before the first measurement window (rotation only shows up as signal if probes are consistent); categorize sources by persistence tier before computing rotation (top-tier authority sources often appear more persistent and dominate the share that does not rotate; mid-authority sources cycle on a weekly-to-monthly cadence in practitioner tracking); and track per-engine rotation curves over months, not weeks (single weekly snapshots show too much noise to separate rotation from random sampling variance).

The underlying driver of an observed rotation event is not always identifiable from publisher logs alone (it can involve retrieval, ranking, grounding-time selection, or UI sampling), so citation rotation is best understood as a measurement category, not a confirmed internal mechanism. No vendor has published a per-source rotation policy.

Which metric for which question

The six metrics answer different questions. A practitioner playbook for matching question to metric:

Question	Metric	Key consideration
Am I cited at all?	Attribution rate	Per-engine baseline first; aggregate hides the structural gap.
Am I cited more than competitors for this topic?	Citation share	Define the topic cluster first; freeze the query set for ~8 weeks.
Is my citation a real link or just a name mention?	Citation match rate	Record link state and mention type as two independent dimensions.
Why is this content getting cited (or not)?	Cite-ability	The only metric that lives in the content; the lever the others react to.
What is my new-citation count per period?	Citation velocity	Distinguish from acceleration (change in velocity between periods); store per-probe lists, not aggregate counts.
Will my citation persist next week?	Citation rotation	Per-engine; minimum useful sample is months, not weeks.

In this glossary's framing, two metrics typically anchor a GEO measurement program: attribution rate for the simple "am I cited" question, citation share for the competitive "where do I stand" question. The other four are diagnostic layers added once the basic baseline is stable. The decision is not "which one to pick" but "which one to add next" as the program matures.

What no single metric captures

Each of the six captures something the others do not, but none captures everything. Gaps that matter for GEO program decisions:

Engine-traffic-weighted value. Citation share weighted by each engine's traffic share is the most strategically accurate variant, but reliable per-engine traffic data is often unavailable. A 22% citation share on AI Overview (with high downstream traffic) is more valuable than a 22% citation share on Copilot (with lower downstream traffic), but no metric in the cluster captures this weighting natively. Practitioners proxy this with click-through tracking when available.
Click-through rate after citation. None of the six metrics measures whether a citation drives a click. AI citations often satisfy the user inside the answer without a referral visit. Track attribution rate (cited or not) and referral click-through rate (clicks per citation) as separate metrics; conflating them hides whether a low click-through reflects "not cited" or "cited but no click". Click-through measurement also requires solving external traffic disambiguation, which adds its own per-source measurement discipline.
Sustained vs spike quality. Citation velocity reports a fresh count per period, but a velocity spike that decays within two weeks is not the same as a sustained level shift. The cluster does not include a "sustained quality" metric; practitioners read this from the combination of velocity (rate of new), citation rotation (persistence), and attribution rate (current state) across multiple windows.
Cross-engine prompt-set alignment. Comparing citation share across engines assumes the same prompt set is meaningful on each surface. In practice, query reformulations differ across engines (Perplexity fan-out, AI Overview query suggestions, ChatGPT search query rewriting), and per-source citation differences can reflect engine-specific query handling rather than per-source performance. The cluster does not include a prompt-set normalization metric; this remains a known measurement caveat.
Reactive vs organic rotation drivers. The citation-rotation entry hedges that rotation drivers are partially explainable (vendor parameter changes, partnership announcements) and partially residual noise (natural index refresh churn). No vendor has published a per-source rotation policy, and the cluster has no metric that decomposes rotation into reactive vs organic components.
Cross-tool comparability at the denominator. A SaaS vendor reporting "your brand has been mentioned 200 times by ChatGPT this week" may be counting linked citations only, brand-name mentions in body prose regardless of links, or both. The six cluster metrics align around the Citation vs Mention vs Link taxonomy and label which cells they count; third-party tracking tools (Profound, Otterly, AthenaHQ, Peec AI, Ahrefs Brand Radar) each implement different counting rules and rarely surface the per-cell breakdown. Cross-tool numbers should not be assumed apples-to-apples without checking each tool's methodology page.
Cross-engine cited-set agreement. All six metrics measure one source's citation on a given engine (or its share against competitors); none measures whether two engines cite the same pages as each other for the same prompts. Under a fixed prompt set the engines' cited-source sets can overlap very little, so a citation on one engine is a weak predictor of a citation on the next. Low overlap is partly arithmetic (sparse citation sets overlap little by chance) and does not by itself prove the engines actively diverge, but it does mean a single blended "AI visibility" number averages across systems that mostly cite different pages. The Ahrefs cross-engine overlap figures above are one shadow of this; our engine-disjoint citation dispatch measures the cited-set overlap directly. No metric in the cluster quantifies cross-engine agreement natively.

Adoption sequencing playbook

For a GEO program building the measurement framework from zero, a sequenced adoption that matches the diagnostic value of each metric:

Week 1: attribution rate per engine. Lock a 10-query prompt set, probe weekly across ChatGPT / Perplexity / Claude / Copilot / Google (with Gemini chat, AI Mode, and AI Overview each probed separately and rolled up under Google for the rate calculation). Record cited URL, domain, position in the source list, the answer's verbatim wording, and whether the citation is linked. This single decision returns a per-engine baseline by the end of the first weekly probe round.
Continuously: citation match rate, as a free add-on. Citation match rate uses the same probe data as attribution rate; you are already recording link state per reference. Surface the match-rate breakout in the same weekly report.
Promote to content process: cite-ability. Cite-ability is not a measurement output; it is the content-side discipline that drives the others. Add the "context-free quote" test and the four-trait checklist to the content review process. Cite-ability moves slowly; the lead indicator that the discipline is working is improvement in citation velocity (next).
Week 9: citation share + citation velocity. By Week 9 you have eight weeks of attribution-rate data, which is enough to compute citation velocity (probe-to-probe deltas, store the per-probe citation lists not just aggregates) and to draw a competitive frame for citation share (define the topic cluster, tabulate competitor URL / domain / brand presence across the same prompt set).
Quarterly: citation rotation. Citation rotation needs months, not weeks, of data to separate signal from noise. At the end of the first quarter, compute per-engine rotation curves over the full 12 to 13 weekly probe rounds. Use the rotation report as the quarterly diagnostic for which surfaces require more frequent monitoring and refresh versus which appear more stable once authority and source fit are established. The trade-off is observational rather than vendor-prescribed.

What to skip at every stage: composite "AI visibility scores" that bundle the six metrics into one number. They average away the per-engine, per-metric gaps that the framework exists to surface.

How the cluster fits the broader GEO surface

The six metrics in this pillar form the citation-metrics cluster within the broader GEO measurement surface. Three adjacent clusters interact with this one:

Surface-family entries (AI Overview citation, AI Mode, Microsoft Copilot citations, Perplexity citation, Claude citation, Gemini citation, ChatGPT search citation, Brave Search citation, Grok citation, DuckDuckGo AI citation, Meta AI citation, and AI dev tool citations): each citation metric in this pillar must be computed per surface, not just per engine. The surface-family cluster describes the engine-specific measurement caveats that the per-engine breakouts in this pillar inherit. The next section provides a quick-reference comparison across the 12 surfaces on four practitioner-relevant axes.
Brand mentions in AI answers (brand mentions in AI answers): the broader presence dimension that includes unlinked name references. Brand mention rate often exceeds attribution rate because AI engines mention well-known brands even when they do not cite them. The brand-mentions entry sits adjacent to the citation-metrics cluster but is not one of the six core anchors.
Traffic-side measurement (external traffic disambiguation): the methodology for separating AI-driven referral visits from other traffic sources. Citation metrics measure the upstream event (was the source cited); traffic disambiguation measures the downstream event (did the citation drive a visit). Both sides of the funnel need their own measurement discipline.

The umbrella for all three clusters is generative engine optimization: the practice of writing, structuring, and signaling content for AI search engines. The citation-metrics cluster is the measurement layer of that practice; the surface-family cluster is the per-engine implementation layer; brand mentions and traffic disambiguation are the adjacent presence and outcome layers.

Cross-surface quick reference

The 12 citation surfaces in the cluster differ on four practitioner-relevant axes. A given citation-tracking program should not collapse them into a single "AI citation" rate; the structural differences below determine which metrics apply, what crawler rules to verify, and where to expect zero-sum competition for citation slots.

Inclusion criterion: an active surface anchor is a citation-bearing AI surface that (a) is in publicly available production as of 2026, (b) renders citations or source attribution visible to end users or downstream developers, and (c) has its own dedicated entry in this cluster. The count excludes deprecated surfaces (Search Generative Experience, which has been superseded by AI Overview and AI Mode) and foundational concept entries that are not citation surfaces themselves (the AI Overview feature page is the concept entry; AI Overview citation is the surface anchor counted below).

Surface	Index provenance	Crawler discipline (robots.txt)	Default citation rendering	Structurally unique feature
AI Overview	Google	Googlebot, Google-Extended	Linked source panel under generated answer	Highest reach Google-derived surface; tested on most queries
AI Mode	Google	Googlebot, Google-Extended	Linked sources alongside generated answer	Google's conversational-mode counterpart to AI Overview
Gemini	Google	Googlebot, Google-Extended	Linked sources in answer	Standalone Google chat product; same index family as AI Overview / AI Mode
Microsoft Copilot	Bing	Bingbot	Linked source-card panel	Bing-grounded; IndexNow push protocol applies upstream
ChatGPT search	Not fully vendor-documented (third-party reporting: Bing-grounded behavior at points)	OAI-SearchBot, GPTBot, ChatGPT-User	Inline citations + source-card panel	Five product surfaces (chatgpt.com, Desktop, mobile, Atlas browser, API)
Perplexity	Hybrid (own crawl + partner indexes)	PerplexityBot	Prominent inline citations over a wide source pool (~10-20+ sources)	Widest citation pool in the cluster; lowest per-slot competition
Claude	Multi-provider (web tool + connectors)	ClaudeBot, anthropic-ai, claude-web	Linked sources in tool-augmented answers	First-party crawler documentation; chat product not search-first
Brave Search	Own independent index	Brave's own crawler	Linked sources beneath AI Answers and Ask Brave	"Only independent search index at scale outside of Big Tech" (Brave)
Grok	Own web index + native X (Twitter) integration	Opaque: documented user agents (GrokBot, xAI-Grok, Grok-DeepSearch) not seen in actual traffic; observed retrieval from proxy/datacenter IPs with spoofed Chrome/Safari UAs	Inline citations in WebSearch and DeepSearch; visible reasoning trace in DeepSearch	Only cluster anchor where robots.txt is effectively unenforceable; only anchor with native X-platform content as first-class source
DuckDuckGo Search Assist	Own (DuckAssistBot)	DuckAssistBot	1-2 linked sources only beneath summary; configurable frequency	Narrowest citation slot in cluster (zero-sum competition tightest); Duck.ai is separate third-party-model chat proxy
Meta AI	Tier-gated: licensed publishers (Dec 2025 deals) + general web (backend mix not fully documented) + Instagram / Facebook / Threads public posts	Meta-ExternalAgent (training), Meta-ExternalFetcher (retrieval); respect robots.txt	Tier-gated: licensed partners get linked attribution on news queries; non-licensed and non-news queries may be summarized without inline links	Only cluster anchor where citation behavior depends on the publisher's commercial relationship with the vendor; only anchor with Meta-internal social posts as first-class source
AI dev tool citations	Varies by tool (Cursor / Cline / Codex / Copilot etc.)	Varies by tool	Varies (inline citations in some; no citations in others)	Code-context retrieval rather than web search; different measurement surface than consumer AI search

Cross-cutting observations:

Citation slot count is structural, not just a rendering choice. DuckDuckGo Search Assist's 1-2 source commitment makes per-query citation share competition tighter than Perplexity's ~10-20 slot pool by an order of magnitude. Citation-share targets should be calibrated to slot count per surface.
Crawler controllability is not uniform. OpenAI, Anthropic, Google, Microsoft, and DuckDuckGo all publish first-party crawler documentation with stable user-agent strings and robots.txt contracts. Grok is the outlier: observed retrieval traffic spoofs browser UAs and rotates through proxy / datacenter IPs, making robots.txt rules effectively unenforceable. Meta-ExternalAgent and Meta-ExternalFetcher respect robots.txt, though Meta's first-party crawler documentation is less prominent than peer vendors'.
Licensing-tiered citation is unique to Meta. No other cluster anchor gates citation behavior on a publisher's commercial relationship with the vendor; for news queries on Meta AI, attribution rate against your domain partly depends on whether you are a 2025-12 licensing partner (CNN, Fox News, Fox Sports, USA Today, People Inc, Daily Caller, Washington Examiner, Le Monde Group).
Native non-web source categories matter for measurement. Grok cites X posts as a first-class category; Meta AI cites Instagram, Facebook, and Threads public posts. These do not appear in web-domain citation tracking and require their own measurement columns. A "Grok or Meta AI citation rate" computed only against web URLs systematically undercounts.
A citation is attribution, not factual correctness. Across all 12 surfaces, a citation event indicates that the engine surfaced a source; it does not indicate that the cited source is accurate, that the source's content was used correctly, or that the cited claim is not a hallucination. See hallucination grounding for the broader framing.

Louise Linehan & Xibeijia Guan, "Only 12% of AI Cited URLs Rank in Google's Top 10 for the Original Prompt," Ahrefs Blog, 2025-08-11. ahrefs.com/blog/ai-search-overlap. The 12% headline averages five measurements: Perplexity 28.6%, ChatGPT (in-text) 8%, ChatGPT (references) 6.1%, Gemini 8.6%, Copilot 8.2%. The structural per-engine gap (Perplexity at ~29% vs ChatGPT / Gemini / Copilot at ~7-8%) is the cross-engine measurement asymmetry this pillar's per-engine framing inherits. ↩
Aggarwal et al. "GEO: Generative Engine Optimization." arXiv:2311.09735, November 2023. Introduces the GEO research framework and tests nine content-modification methods against a Position-Adjusted Word Count metric and a Subscription Impression metric. This cluster uses the paper as a starting point but redefines citation counting in per-engine, per-query-set terms that the paper does not specify; "attribution rate" as defined in this cluster is not the paper's term. ↩ ↩²
Eugene Garfield. "Citation Indexes for Science: A New Dimension in Documentation Through Association of Ideas." Science 122(3159):108-111, July 1955. DOI 10.1126/science.122.3159.108. Garfield founded the Institute for Scientific Information in 1960 and launched the Science Citation Index in 1963; ISI later became part of Clarivate (Web of Science). Citation velocity in the bibliometrics sense (rate of new citations to a paper over time) emerged from this lineage as a standard measure. The AI-engine version of the metric in this pillar's velocity section uses the same metric shape but applies it to AI-engine citation surfaces rather than scholarly publication surfaces. ↩ ↩²

Part of Citation metrics· editorial cluster, not a semantic link

Also in this cluster: AI visibility · Attribution rate · Brand mentions in AI answers · Citation Footprint · Citation match rate · +5 more

Mentioned in· auto-generated from other terms' related lists

FAQ

Why six metrics and not one composite score?: Each of the six captures a property the others do not, and the metrics often disagree on the same source for the same prompt set. A composite score averages those disagreements into one number that hides which engine is moving and which content property is responsible. The trade-off cost (six dashboards instead of one) is justified when measurements drive content decisions; the diagnostic information is in the per-metric breakouts, not in the average.
If I had to start with one metric, which is it?: Attribution rate, computed per engine, against a fixed 10-query prompt set. It is the simplest measurement (cited or not), it requires no comparison set, and it returns a usable per-engine baseline within one weekly probe round. Add citation share at week 9 once you have eight weeks of attribution-rate data to draw a competitive frame against; add the rest incrementally.
Do the six metrics work the same across engines?: No, and the cross-engine asymmetries are large enough that a per-engine breakout is structural, not optional. Citation match rate effectively equals attribution rate on Google AI Overview (every source-panel entry is linked) but diverges on ChatGPT / Claude / Gemini where link state depends on session settings. Per-engine measurement differences show up in adjacent data too: in Ahrefs' August 2025 long-tail study, the overlap between AI-cited URLs and Google's top 10 for the same query was ~29% on Perplexity vs ~7-8% on ChatGPT / Gemini / Copilot. The exact gap varies by which metric you compute, but the underlying point holds: one aggregated cross-engine number hides the structural differences that drive optimization decisions. Each anchor entry's per-engine notes spell out the engine-specific caveats.
How do the six metrics relate to traditional SEO metrics?: Loose analogies, not direct replacements. Attribution rate is closest to impression share but at the AI-response level instead of SERP level. Citation share is the AI-search analog of share-of-voice but built on citation events instead of impression positions. Citation match rate is closest to a 'linked vs unlinked mention' split (which has no exact SEO counterpart). Cite-ability is closest to content-quality scoring but operationalized for AI extraction, not human readability. Citation velocity and citation rotation have no clean SEO counterparts: SERP ranks are tracked per query but not as time-derivatives or temporal-stability measures at the citation level.
Are any of the six defined in vendor or academic literature?: None of the six AI-search operationalizations are defined in current vendor documentation or peer-reviewed academic literature. Citation velocity inherits the metric shape from academic bibliometrics (Garfield 1955 onward) but the AI-engine application is recent and practitioner-coined. The other five are practitioner-coined for the AI search context; several vendor blogs (UltraScout, Rankeo, Steakhouse, Profound, Otterly, Peec) publish related framings, typically with looser operational definitions than this cluster uses. Tools differ on counting rules; cross-tool numbers should not be assumed apples-to-apples.

Sources & further reading

New terms shipped that week, plus one observation from the AI-citation tracker.

More about what you'll get

Last fact-checked 2026-05-27. Spotted an error or stale claim? See editorial methodology.

Changelog (6 entries)

2026-06-18: Added a seventh gap to 'What no single metric captures': cross-engine cited-set agreement. None of the six metrics measures whether two engines cite the same pages as each other for the same prompts, and under a fixed prompt set the engines' cited-source sets often overlap little, so a single blended 'AI visibility' number averages across systems that mostly cite different pages. Links the new engine-disjoint citation dispatch, which measures that overlap directly.
2026-05-28: Expanded the surface-family list from 8 to 12 anchors (added Brave Search, Grok, DuckDuckGo AI, Meta AI shipped today). Added a Cross-surface quick reference section: comparison table across the 12 surfaces on four axes (index provenance, crawler discipline, default citation rendering, structurally unique feature) plus five cross-cutting observations. Surfaces structural differences: DuckDuckGo's 1-2 source slot vs Perplexity's ~10-20 (10x tighter zero-sum competition), Grok's effectively unenforceable robots.txt, Meta's licensing-tiered citation behavior, non-web source categories (Grok X posts, Meta Instagram/Facebook/Threads).
2026-05-27: Same-day revision. Critical: reframed 'velocity = time-derivative of attribution rate' as 'temporal leading indicator' (different units; cluster-ripple to citation-velocity anchor); softened Aggarwal attribution from 'foundational paper formalizing GEO measurement framework' to 'starting point' (paper uses PAWC + Subscription Impression, not attribution rate). Substantive: decision matrix Q5 reworded, 'rewards content effort' softened, 'Two metrics typically anchor' qualified, composite-score claim adds vendor inline examples, velocity-vs-acceleration distinction explicit.
2026-05-27: Same-day self-review fix. Corrected a metric-conflation error in 3 places (lede, attribution-rate section, FAQ #3) that mislabelled Ahrefs' AI-cited-URL-to-Google-top-10 overlap percentages (~29% Perplexity, ~7-8% ChatGPT/Gemini/Copilot) as 'attribution rate' measurements. The Ahrefs footnote was always accurate; the body had drifted to a shorthand that conflated the two metrics. Rewrote those passages to label the percentages correctly as cross-engine overlap data and to use it as illustrative evidence for the 'aggregation hides structure' point rather than as direct attribution-rate numbers.
2026-05-27: Initial publish. Pillar synthesis of the citation-metrics cluster (6 anchor entries: attribution-rate, citation-share, citation-match-rate, cite-ability, citation-velocity, citation-rotation). Organizes the 6 metrics along three axes (output ratios, input content property, temporal signals) and adds a decision matrix, gap analysis, and adoption sequencing playbook. Pillar length (~3000 words) intentionally exceeds the term-entry target.
2026-06-29: Perplexity citation confirmed for the definition query. Perplexity surfaced the AI citation metrics overview inline and in its sources-used list, reproducing the per-metric breakdown. First tracked engine to cite this pillar.