Playbook
How to Get Cited by AI: The Complete 2026 Playbook
Everything that decides whether ChatGPT, Claude, Perplexity and Google's AI Overview name your brand — the four surfaces, the on-page spec, the schema, the entity and the links — in one place, with the data behind each move.
GetCited · 16 min read · Updated 24 June 2026
If you only read one thing about getting cited by AI, read this. It pulls together everything that decides whether an answer engine names your brand — the four surfaces and how each one chooses, the on-page spec that makes a page liftable, the schema and entity work that make you legible to a machine, and the off-site links that do most of the actual lifting — and tells you the order to do them in.
Every claim here is anchored to data: our own 3,050-answer dataset where we have it, named third-party studies where we don't.
What "getting cited" actually means
A citation is an AI answer that names or links your brand as a source. It is not a ranking and it is not a click — it is being on the shortlist the model hands a buyer at the moment of decision. That is why it matters: the visitor who follows an AI citation arrives pre-sold, and converts several times harder than ordinary organic traffic (Seer measured one site at 15.9% from ChatGPT versus 1.76% from Google organic).
Citation is measured as a frequency, not a yes/no. LLMs are non-deterministic — ask the same question five times and you may be cited twice. So everything below is measured the same way: a fixed battery of real buyer questions, run repeatedly across the surfaces, before and after the work.
The four surfaces — and why they disagree
There is no single "AI". There are four answer engines that retrieve and cite differently, and our dataset shows they barely agree on who to cite. Win one and you have not won the others.
| Surface | How it retrieves | What it rewards | |---|---|---| | ChatGPT Search | Live web search + its own index | Answer-first structure, freshness, clear on-page facts | | Claude (web search) | Live web search, fewer sources per answer | Genuinely useful, well-structured pages; rewards depth | | Perplexity | Aggressive multi-source retrieval, many citations | Breadth of corroborating sources, listicles, fresh pages | | Google AI Overviews | Google index + AI layer | Schema, entities, and signals that already rank in Google |
The practical consequence: two-thirds of AI answers cite a source, but which sources differ sharply by engine and by category. A page tuned only for one engine leaves the other three on the table. The work below is what moves all four at once. (Surface-specific tactics live in the ChatGPT guide and the Perplexity & AI Overviews guide.)
Lever 1 — Write to the citation spec
The single biggest controllable factor is whether the model can lift a clean answer from your page. Most pages can't be lifted: the answer is buried in paragraph three, there's no table, the heading is a creative title instead of the question. Our on-page spec fixes exactly that:
- Answer first. The first 40–60 words after the H1 answer the question directly. Every H2 opens with a self-contained 40–75-word answer — passages that size are cited far more often than long ones. (Full detail in the content chunk spec.)
- Headings mirror the question. "Does Vietnam have a nomad visa?" — not "Visa considerations". The model matches questions to headings.
- 15+ named entities per page — products, places, tools, brands. Entity-dense pages are markedly more cited in AI Overviews.
- One original, dated fact every ~150–200 words — a price, a processing time, a measured result only you have. Originality is what makes you the source rather than a paraphrase of someone else.
- A comparison table on every "best X / X vs Y" page, and a 3+ question FAQ on every guide.
- Visible freshness — a real Last updated date. Recently-refreshed content is lifted significantly more on ChatGPT.
This is the fastest, cheapest lever because it usually means making existing pages legible, not writing net-new.
Lever 2 — Implement the schema
Schema is the machine-readable layer that tells an AI what your content is: that your ranking page is a ranking, your reviews are reviews, your FAQ is question-and-answer. Without it, a human sees your FAQ and the model sees undifferentiated text.
| Schema | Where | What it unlocks |
|---|---|---|
| Organization + sameAs | Site-wide | Identity + links to your profiles — the entity signal |
| Article + Person | Guides | Author and dates — the authorship and freshness the AI reads |
| FAQPage | Pages with 3+ Q&A | The exact Q&A the model lifts verbatim |
| ItemList + Review | Ranking / comparison pages | Your scored list read as a ranking |
| BreadcrumbList | Site-wide | Structure the crawler follows |
Add an /llms.txt at your root pointing AI crawlers at your best content, and confirm your robots.txt actually lets the AI bots in (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) — a surprising number of sites lock them out by accident.
Lever 3 — Build the entity
Models treat a brand they recognise as a thing, and a brand they don't as a string of letters. An entity build — a Wikidata item with sameAs wiring to your real profiles — makes "your brand" resolve to a known publisher, which lifts brand-mention rate and trust across every surface. This is the work in Wikidata for brands. It compounds over weeks, not days.
Lever 4 — Earn the off-site citations
Here is the uncomfortable truth: roughly 90% of AI citations point at third-party pages, not your own site. The model mostly cites the sources that other people trust — listicles, review platforms, community threads, industry media.
That makes off-site the heaviest lever, and it has three parts:
- Listicles. Being named on the "best X" lists the AI already cites. Listicles take a disproportionate share of citations on commercial questions — why listicles win covers the mechanism, and listicles by platform covers how each engine treats them.
- Review platforms. G2, Capterra, Trustpilot and category equivalents are citation gates for a lot of commercial answers — see review platforms and AI.
- Community and reference sources. Reddit and Wikipedia are cited far out of proportion to their share of the web — the Reddit & Wikipedia citation levers.
Off-site is the slowest lever because it depends on other people publishing — but it's the one that moves the needle furthest.
The order to do it in
Sequence matters. Do the controllable, fast work first so you see movement, then the compounding work:
- Baseline. Lock a 20-question battery, scan all four surfaces. You can't prove movement without a "before".
- On-page + schema + llms.txt (Lever 1 + 2). Fast, fully in your control. First new citations typically show on ChatGPT and AI Overviews within ~14 days.
- Entity + off-site (Lever 3 + 4). The heaviest levers; they compound across 30–90 days.
- Measure. Re-run the identical battery at day 14 and day 30. The proof is the same numbers as the baseline, moved.
Measure it, or it didn't happen
The whole point of AI citation is that it is measurable. Wire GA4's "AI Assistant" channel (added 13 May 2026) plus a custom channel group with a referrer regex for Perplexity and Copilot, track your real conversion event, and read AI-channel conversion versus the rest of your traffic monthly. Be honest about the limit: AI Overviews and some answers pass no referrer, so they land as "direct" — measure the trackable channel and never claim a conversion you can't attribute.
Frequently asked questions
How long until I get cited? On-page and schema work shows first movement in about 14 days on ChatGPT and AI Overviews. Entity and off-site work compounds across 30–90 days. Anyone promising overnight results is guessing.
Is AI citation the same as SEO? Overlapping, not identical. AEO/GEO optimises for being the cited answer rather than the ranked link. The schema and quality signals overlap with SEO; the answer-first structure and the off-site citation surfaces are specific to AI. See what is AEO.
Do I need to write new content? Usually not at first. The fastest wins come from making content you already have legible — answer-first structure, schema, freshness — not from writing more.
Which surface should I optimise for? All four, because they disagree. The levers above move all of them; surface-specific tactics are the last 10%.
Can I do this myself? The on-page and schema work, yes. The entity and off-site placement are where most teams stall — they take ongoing outreach. That's the part GetCited does for you, measured, with a refund if the threshold isn't met.
Sources
- GetCited — primary dataset: 3,050 AI answers / 11,703 deduped citations across ChatGPT, Claude and Google AI Overviews, June 2026 (2026)
- ALM Corp / Omniscient — 23,000-citation analysis (listicle citation share) (2025)
- Semrush — 150k-citation LLM reference study (Reddit, Wikipedia share) (2025)
- Seer Interactive — AI vs organic conversion-rate measurement (2025)
Related guides
What AI Actually Cites: 3,050 Answers Analysed
We ran the same commercial questions through ChatGPT, Claude and Google AI Overviews and counted every source they cited. The engines don't agree — and what's citable depends on your category.
How to Get Cited by ChatGPT in 14 Days
A step-by-step playbook for earning citations in ChatGPT Search — the prompt battery, the on-page spec, the entity layer, and the one lever that moves the needle most.
How to Get Cited by Perplexity and in Google AI Overviews
Perplexity and Google AI Overviews retrieve differently from ChatGPT — here are the extra moves that earn citations on each, on top of the shared fundamentals.
The Chunk Spec: How to Structure Content So AI Cites It
AI answer engines lift the one passage that answers the question. Structure your page as self-contained, liftable chunks — and a before-and-after that proves it — to get cited far more often.
Want this done for you — and proven?
GetCited measures whether ChatGPT, Perplexity, Google AI Overviews and Claude cite your brand, then does the work to move it — with the dated transcripts behind every number.