Case study
Building a citation magnet: 522 EU regulator warnings, one original-data study
How we turned seven public regulator registers into a single structured dataset engineered to be the source AI answers cite — the build, the integrity rails, and what we measure next.
GetCited · 6 min read · Updated 24 June 2026
Most "AI visibility" advice stops at "publish helpful content." That's not a lever — it's a hope. The thing answer engines actually reach for, again and again, is original data nobody else has: a number, a list, a table that only exists on one page. So the most reliable way to get cited is to become that page.
This is a worked example of doing exactly that — on one of our own properties, Spreadwise, a forex/CFD comparison site operating in a market (the EU) where regulation banned the usual marketing playbook. No bonuses, no paid ads, no inducements. The only way in is to be cited.
The gap
The incumbents — BrokerChooser, ForexBrokers, CompareForexBrokers — own the "best broker" prompts. What almost none of them publish is a rigorous analysis of the official warning lists: the public registers where EU and UK-adjacent regulators flag firms not authorised to solicit their residents. That's a genuine citation gap — a question AI engines get asked ("which forex brokers are flagged by regulators?") with no single authoritative source to cite.
So we built that source.
What we built
An original-data study aggregating the official, published registers into one structured dataset:
- 522 forex/CFD warning-list entries, verified against four official EU and UK-adjacent regulator registers (the AMF's open-data blacklist alone contributed 483 forex-categorised entries; the rest from CONSOB, BaFin, CySEC, FCA and the IOSCO cross-regulator database).
- Every row carries its source regulator, jurisdiction, date listed, category, and a link to the regulator's own published record.
- A downloadable CSV of all 522 rows, so the data is independently checkable — and quotable.
The page leads with the single headline finding in the first 40–75 words (the chunk an answer engine lifts), then ranked tables by regulator and by year, a reproducible methodology, and a downloadable dataset.
The integrity rails (this is the whole game on a YMYL topic)
Warning-list data is legally sensitive — get it wrong and you've defamed a firm. So the build ran on hard rails, the same ones every GetCited asset uses:
- No fabricated or estimated numbers. Only counts the registers actually contain. Every figure adversarially fact-checked before publish.
- Every source URL live-verified and record-verified — a 200 response isn't the right record.
- Verbatim regulator framing only. A warning-list entry means a firm is unauthorised to solicit in that jurisdiction — not proven fraud. We report each regulator's own published entry and link it; we never add a "scam" label of our own. The list is a point-in-time snapshot; firms can be cleared.
- Honest byline. An organisational editorial byline, not an invented expert.
Trustworthy structure isn't just ethics here — it's the citation lever. Engines cite sources that are specific, sourced, and neutral. The rails are the optimisation.
Why this is built to be cited
Three properties make a page the one an engine reaches for, and this asset has all three:
- Original data — 522 verified rows that exist nowhere else in one place.
- Answer-first structure — the headline finding is the first thing on the page, in liftable form.
- Reproducible sourcing — a methodology and a downloadable dataset, so the claim survives scrutiny.
What we measure next
We don't claim a citation outcome we haven't measured — that would break the one rule the whole product rests on. The honest status: the asset is built and verified; the next step is the GetCited measurement loop — a baseline across ChatGPT, Perplexity, Google AI Overviews and Claude on the target prompts, a distribution pass to seed the asset where engines already look, then a re-scan to read the movement. When that endline lands, the numbers — every one of them on a dated transcript — go here.
That's the method: build the thing that deserves the citation, then prove the citation, with receipts.
Sources
- AMF — liste noire (unauthorised forex/CFD sites), French open-data register (2026)
- FCA Warning List of unauthorised firms (UK) (2026)
- CONSOB — blocked-site orders (Italy) (2026)
- BaFin — unauthorised-business warnings (Germany) (2026)
Want this done for you — and proven?
GetCited measures whether ChatGPT, Perplexity, Google AI Overviews and Claude cite your brand, then does the work to move it — with the dated transcripts behind every number.