GEO Glossary

/terms/gemini-citation · 4 min read · intermediate

Gemini citation

Gemini citations are the source attributions Google's Gemini chatbot and API produce when answering questions with grounded real-time web content. Citations appear inline in the Gemini chatbot app (gemini.google.com and mobile) and as structured groundingMetadata fields (with groundingChunks containing uri and title) in the Gemini API response. Distinct from AI Overview and AI Mode, which are Google Search surfaces, not Gemini app surfaces.

Citation status

ChatGPTPerplexityClaudeCopilotGemini

Last checked 2026-05-26

Gemini citations are the source attributions Google's Gemini chatbot and API produce when responses are grounded on real-time web content. In the consumer chatbot (gemini.google.com web app and the mobile apps on Android and iOS), citations appear inline as source links or chips attached to the response, with a sources panel listing referenced URLs. In the Gemini API, citations are returned as a structured groundingMetadata object containing groundingChunks (each with uri and title), groundingSupports (linking text segments to source indices), and webSearchQueries1.

Gemini citations are distinct from AI Overview citation and AI Mode, which are Google Search surfaces (the AI panel above SERP results, and the conversational tab inside Google Search, respectively). All three are Google products, but they are separate surfaces with different UI, reporting, API billing, and citation rendering; Google has not published whether their source-selection logic is identical. Optimizing for one does not automatically optimize for the others. Gemini is also a distinct measurement target from Perplexity citation, Claude citation, Microsoft Copilot citations, and AI dev tool citations; same-domain citation rates vary across engines.

Status in 2026

The Gemini product line was renamed from Bard to Gemini on 2024-02-08, with the iOS and Android Gemini mobile apps launching the same day and the Gemini Advanced subscription introduced via Google One AI Premium2. Google Search grounding (the API feature that gives Gemini real-time web search citations) first shipped in the Gemini 2.0 Flash Experimental release in December 2024 and is supported across subsequent Gemini model families; Google's API documentation maintains the active list of grounding-capable models.

Four product surfaces matter for citation tracking:

  • Gemini web app (gemini.google.com): the primary consumer surface. Citations render as inline source links or chips attached to relevant response segments, with an expandable sources panel.
  • Gemini mobile apps (Android, iOS): launched 2024-02-08 alongside the rebrand. Compact card view for sources; appears to follow broadly similar retrieval and citation conventions as the web app, though product-specific behavior should be tested separately.
  • Gemini API: developers integrate grounding via the google_search tool. Response structure is documented:
    • groundingMetadata.webSearchQueries: the queries the model executed
    • groundingMetadata.groundingChunks: web sources, each with uri and title
    • groundingMetadata.groundingSupports: links text segments (via startIndex / endIndex) to source indices through groundingChunkIndices
    • groundingMetadata.searchEntryPoint: HTML / CSS for a Search Suggestions widget the developer can render Older Gemini 1.0 / 1.5 models require the deprecated google_search_retrieval tool. Pricing differs by model family: Gemini 3 series bills per search query the model decides to execute; Gemini 2.5 and older bill per prompt regardless of search count.
  • Vertex AI deployment: enterprise developers access Gemini grounding through Google Cloud's Vertex AI surface, which is the primary enterprise GCP deployment for Gemini API access. Grounding response shape tracks the direct API; per-account billing flows through GCP rather than the direct Gemini API key.
  • Deep Research mode (inside the Gemini consumer app, debuted alongside Gemini 2.0 Flash in December 2024): when enabled, Gemini executes many web searches in a multi-step research workflow and produces a long-form report with citations to all sources used. This is the same citation infrastructure as the consumer chatbot, but typically with many more cited sources per response (tens to hundreds, not single digits).

Detection methodology by surface: referrer-based detection works differently across the four surfaces.

Surface Referrer-based detection
Gemini web app (gemini.google.com) ✅ Works; citation clicks send referrer host gemini.google.com
Gemini mobile app (Android / iOS) ⚠️ Varies; depends on whether clicks open in an in-app webview or external browser
Gemini API (downstream apps) ❌ Server-to-server; downstream apps using google_search produce no Gemini-attributable referrer in publisher logs
Vertex AI ❌ Server-to-server; same as direct API

Server-log filtering captures only the gemini.google.com web subset reliably; the other surfaces require active probing or downstream-app cooperation.

Google's documentation describes grounding's purpose verbatim as helping to "reduce model hallucinations by basing responses on real-world information"3, with the model autonomously deciding when a search would improve the response.

How to apply

Three layers of practitioner action for inclusion in Gemini's grounding pool:

  • Have strong Google Search indexability: Gemini's grounding feature is Google Search grounding, so strong Google Search indexability is the most reasonable starting assumption (sitemap submission via Google Search Console, robots.txt allowing Googlebot, clean canonicals, no JavaScript-only content). Google has not documented the exact gating path from Googlebot crawl to Gemini citation, so this is a defensible inference rather than a vendor-confirmed prerequisite. Separately, Google-Extended (documented in AI crawler bots) is the robots.txt control token Google documents as a training and use-control signal for Gemini family models, not as a grounding-time retrieval control; allowing Googlebot but disallowing Google-Extended is a defensible configuration for sites that want Google Search indexing but not training-set inclusion.
  • Optimize for citation-friendly content shape: clean answer-block structure, source-supported claims, and primary-source attribution make a page easier to retrieve, summarize, and cite. Practitioners generally optimize for Gemini-targeted reach with the same cite-ability disciplines that apply across Perplexity, Claude, and Microsoft Copilot. Whether Gemini's retrieval explicitly weights these features over alternative content shapes is not vendor-published; the practitioner discipline applies regardless.
  • Track Gemini citations separately from AI Overview and AI Mode: aggregating the three Google surfaces into one "Google AI citation rate" hides per-surface signal. AI Overview optimization tactics, AI Mode behavior, and Gemini app behavior diverge measurably. The attribution rate entry covers the measurement decomposition principle.

What to skip:

  • Treating Gemini citation as the same product surface as AI Overview or AI Mode. They are separate measurement targets within the Google ecosystem.
  • Paying for Gemini-specific tracker tools in month 1. No publisher-side dashboard exists from Google for the Gemini app; tracking is via active probes plus standard Google Search Console for organic-search context.
  • Optimizing for Gemini API tool-use coverage. Server-to-server API integrations produce no publisher-visible referrer; the measurement gap is structural and cannot be closed at the content layer.

What remains contested or unverified

  • The exact composition of Gemini's real-time grounding backend. Google has not vendor-documented whether the google_search tool uses the public Google Search index, a Gemini-specific subset, or a hybrid with separate signals. The practitioner inference (Google Search index is the primary source) is plausible but not vendor-confirmed.
  • Whether allowing Googlebot is necessary for inclusion in Gemini's grounding pool. The grounding mechanism is not separately disclosed, so the implication that Google Search indexing is the gating step is inferred rather than confirmed.
  • Whether disallowing Google-Extended in robots.txt affects only training-data ingestion or also indirectly affects grounding-time retrieval. Google's documentation positions Google-Extended as a training-data control; whether it has secondary effects on the Gemini grounding pipeline has not been clarified.
  • How citation rendering differs across Gemini surfaces for the same query. The consumer chatbot, Gemini Advanced subscription tier behavior, and API responses can present different source pools and citation styles; the per-surface differential has not been systematically published.

How it relates to other concepts

  • Parallel surface category to Perplexity citation, Claude citation, Microsoft Copilot citations, and AI dev tool citations: each is a distinct citation-surface family with its own measurement story.
  • Distinct from AI Overview citation and AI Mode within the Google ecosystem: AI Overview is the AI panel in Google Search SERPs, AI Mode is the conversational tab inside Google Search, and Gemini citation is from Google's standalone Gemini chatbot and API. Three separate measurement targets; conflating them loses signal.
  • Crawl-side dependency on AI crawler bots: Googlebot governs Google Search indexability; Google-Extended is the robots.txt control token for opting out of Gemini training-data ingestion. Neither token directly controls Gemini's real-time grounding pool at query time, which is inferred to draw from Google Search.
  • Measurement input to attribution rate and citation-share frames: Gemini is one of the five engines those frames track, with per-query citation behavior that does not generalize from the other Google surfaces (AI Overview, AI Mode).

Footnotes

  1. Google AI for Developers, "Grounding with Google Search" documentation, ai.google.dev/gemini-api/docs/google-search. The google_search tool gives Gemini access to real-time web content. Documented response schema: groundingMetadata object containing webSearchQueries (array of executed queries), searchEntryPoint (HTML / CSS for Search Suggestions widget), groundingChunks (web sources with uri and title), and groundingSupports (linking text segments via startIndex / endIndex to source indices through groundingChunkIndices). Supported on Gemini 3.5 Flash, Gemini 3 Pro / Flash, Gemini 2.5 Pro / Flash variants, and Gemini 2.0 Flash; older models require the deprecated google_search_retrieval tool. Pricing: Gemini 3 series bill per search query the model executes; Gemini 2.5 and older bill per prompt.

  2. Per Wikipedia's "Gemini (chatbot)" article (referenced for product-timeline background only; primary product behavior is verified against Google AI for Developers documentation). Bard launched 2023-03-21 in early-access waitlist; expanded to 180 countries May 2023 (upgraded to PaLM 2); the Gemini language model was announced 2023-12-06; on 2024-02-08 Bard was officially renamed to Gemini, the iOS and Android Gemini mobile apps launched, and the "Gemini Advanced with Ultra 1.0" subscription was introduced via Google One AI Premium. Deep Research feature debuted with the Gemini 2.0 Flash launch in December 2024. The "AI Overview" and "AI Mode" Google Search surfaces are separate products; this entry covers only the Gemini chatbot and API.

  3. Verbatim quote source: ai.google.dev/gemini-api/docs/google-search. Google's Gemini API documentation describes the Google Search grounding feature as helping to "reduce model hallucinations by basing responses on real-world information" and as enabling access to recent events beyond the model's training cutoff. The model autonomously decides whether to search based on the prompt; the grounded response returns with structured citation metadata as described in the main entry.

Mentioned in· auto-generated from other terms' related lists

FAQ

How is Gemini citation different from AI Overview or AI Mode?
All three are Google products but they live on different surfaces. Gemini citations come from Google's standalone Gemini chatbot at gemini.google.com (and the Gemini mobile apps and Gemini API) when the model uses Google Search grounding. AI Overview citations come from the AI panel that appears at the top of Google Search results. AI Mode citations come from the conversational AI tab inside Google Search. The three surfaces use related but separately developed retrieval pipelines, are billed differently in the API, and exhibit different citation rendering. Optimizing for one does not automatically optimize for the others; treat them as three distinct measurement targets within the Google ecosystem.
How does Gemini show citations?
In the consumer Gemini chatbot (web app at gemini.google.com and mobile apps), Gemini renders citations as small inline source links or chips attached to the relevant portion of the response, plus a 'sources' panel listing referenced URLs. In the Gemini API, when the google_search tool is enabled, the response includes a groundingMetadata object with groundingChunks (each containing uri and title fields), groundingSupports (linking response text segments to source indices via startIndex / endIndex and groundingChunkIndices), webSearchQueries (the queries the model executed), and a searchEntryPoint (HTML / CSS for a Search Suggestions widget the developer can render).
Which Gemini models support Google Search grounding?
Per Google's API documentation, current Gemini models supporting the google_search tool include Gemini 3.5 Flash (released May 2026), Gemini 3 Pro and 3 Flash (November and December 2025), Gemini 2.5 Pro and 2.5 Flash variants (March-June 2025), and Gemini 2.0 Flash (January 2025; grounding first appeared in the Gemini 2.0 Flash Experimental release in December 2024). Older Gemini 1.0 and 1.5 models require the deprecated google_search_retrieval tool. For Gemini 3 series models, billing occurs per search query the model decides to execute; Gemini 2.5 and older bill per prompt regardless of search count.
Does allowing Googlebot in robots.txt cover Gemini's web search?
Likely, but Google has not vendor-confirmed the exact retrieval backend for the Gemini app and Gemini API google_search tool. The google_search tool uses Google's web search infrastructure, so allowing Googlebot is a reasonable practitioner assumption. Separately, Google-Extended is the robots.txt control token Google publishes for opting a site out of training data ingestion for Gemini family models without affecting Google Search indexing. The distinction matters: Googlebot allow controls Google Search indexability; Google-Extended controls Gemini training-data inclusion; neither directly controls Gemini's real-time grounding pool at query time, which is inferred to use Google Search but is not vendor-documented as such.

Sources & further reading

Get the weekly digest

New terms shipped that week, plus one observation from the AI-citation tracker.

More about what you'll get