Research · Case study

Citation Laundering: What a German Court Just Confirmed About AI Overviews

We caught an AI engine consuming our verified entity data, stripping the provenance, and re-emitting it as its own confident claim with fabricated sourcing. Three weeks later, a German court described the same mechanism in Google’s AI Overviews — and called it the operator’s own liability.

Published 6 June 2026 · Updated 14 June 2026 12 min readBy Entidex

The short version

We caught an AI engine reproducing our proprietary entity intelligence — AI Visibility, Sentiment, Share of Voice, drift — as confident synthesised knowledge.
It cited none of it back to us. The attributions pointed at domains that have never published any of those metrics.
We call this citation laundering: accurate data, fabricated provenance — and the laundering step is designed to destroy the evidence at source.
On 28 May 2026, the Regional Court of Munich (LG Munich I, 26 O 869/26) ruled on a near-identical mechanism in Google’s AI Overviews — and held the operator directly liable, because synthesising sources into a new statement is the operator’s own speech.
The court named a check that, inside Google, only Google can run. Entidex runs the equivalent from the outside: a dated record of what an engine said, when, on which surface, against a verified record of what is true.

Entidex exists to answer one question precisely: what do AI engines say about an entity, and is it true? To do that we resolve brands, people, products and organisations to canonical, verified records, then measure how each engine perceives them. In the course of that work we caught an AI engine doing something specific and damaging. This is the case study of both halves: the incident we observed, and the ruling that confirmed why it matters.

Part 1

The incident

On 6 June 2026, we ran a simple query through Google’s AI search: “What does AI say about PBD Podcast?”

Google returned a detailed, confident answer. It cited AI Visibility scores, Share of Voice percentages, Sentiment ratings, competitor mappings, and narrative divergence analysis. It described the precise gap between how the press frames the show and how the show’s own content positions itself. It even named the specific AI engines involved — ChatGPT, Perplexity, Grok — and quantified their coverage.

Every observation came from our entity page for PBD Podcast. Google’s AI Overview cited none of them back to us. Instead, the response attributed its claims to general-interest and social domains that contain no entity intelligence data, no visibility scoring, and no cross-surface Sentiment analysis. The citations were anchors bolted onto real data to give the appearance of sourcing.

This is what we call citation laundering. The reproduction had three properties that, taken together, define it:

Provenance stripped.The structured signal we had resolved and verified appeared in the engine’s answer with no attribution back to its actual source.
Restated in the engine’s own voice.The data was not quoted or linked — it was absorbed into a fluent, declarative statement presented as the engine’s own knowledge.
Fabricated citations attached. The engine appended source links that, on inspection, did not contain the claim being made.

What the data shows

The Entidex entity page for PBD Podcast contains live intelligence compiled from 42+ collectors across 8 signal categories. The numbers it presented were not estimates or general knowledge. They were specific, proprietary metrics from a live intelligence pipeline:

100/100

AI Visibility

Computed from probe responses across five AI engines.

71/100

Sentiment

Triangulated from AI engine responses, press coverage, and owned content.

93%

Share of Voice

Measured against a 17-entity podcast cohort.

+0.68 pts/day

Press vs Owned drift

A divergence event detected over a 90-day window.

These are not numbers that exist anywhere else on the internet. They were presented as though they were common knowledge, with citations pointing to domains that have never published them.

The tool designed to detect AI hallucinations about entities was hallucinated about by AI. Except this wasn’t hallucination. The data was accurate. The citations were not.

Why this is worse than a normal error

A wrong link is recoverable — you click it, you see the source, you judge for yourself. A laundered claim removes that escape hatch. The reader has no way to tell that the citation is decorative. The confidence is manufactured at the point of synthesis, and the audit trail that would let anyone check it has already been destroyed.

This is exactly the failure mode Entidex was built to catch, because it is invisible to the entity it harms. A brand does not know an engine is misrepresenting it unless something is continuously comparing what the engine says against a verified record of what is true.

What the structured data shows

We checked our own markup after the incident. The entity page carries:

A schema.org/Dataset block declaring Entidex as the creator, with a DataDownload distribution pointing to our API.
A schema.org/Thing block identifying the entity, with sameAslinking the podcast’s canonical domain.
A schema.org/BreadcrumbList for navigation context.
Full OpenGraph and Twitter Card metadata with entity-specific descriptions.
A canonical URL, proper meta descriptions, and robots: index, follow directives.

The structured data is not merely present — it is comprehensive. It tells any crawler, including Google’s, exactly where this data comes from, who produced it, when it was last updated, and how to access it programmatically. The AI layer ignored all of it. This suggests that Dataset JSON-LD, while useful for traditional search indexing, is not yet being honoured by AI Overview for citation attribution. Discovery and attribution are separate problems, and generative engine optimization has to solve both.

Part 2

The ruling

Regional Court of Munich (LG Munich I) · case 26 O 869/26 · preliminary injunction, 28 May 2026. Builds on the more cautious Frankfurt decision 2-06 O 271/25 (September 2025).

The Regional Court of Munich issued a preliminary injunction barring Google from repeating false statements about two Munich-based publishers, whose names its AI Overviews had wrongly tied to scams, subscription traps and dubious business practices — connections that appeared in none of the cited sources.

The court’s reasoning is the part that matters. It refused to treat AI Overviews like ordinary search results:

Ordinary search links; AI summaries author. Traditional search surfaces indexed third-party content with a title, snippet and link. AI Overviews instead evaluate and combine multiple sources into what the court called “independent, new, and substantive statements.” That makes the output Google’s own speech, not a forwarded third party’s.
The host-provider shield does not apply. German higher-court precedent had given search engines limited liability precisely because they only made third-party content findable. The Munich court found that reasoning breaks the moment the system writes its own answer.
“Just check the sources” is not a defence. Google argued users can dig deeper and verify. The court rejected it: the feature presents a self-contained answer, and the reader is not obliged to cross-examine it.
Only the operator can do the check.The court observed that verifying these statements means comparing the underlying third-party sources against the engine’s own output — and that only Google is positioned to do that internally.

The court classed Google as a direct source of the false content rather than an indirect intermediary, and ordered it to stop the specific statements or face penalties.

The status, stated honestly

This is a German ruling, a preliminary injunction, and not final— Google has said it is reviewing the decision and is expected to appeal. It builds on a more cautious Frankfurt decision from September 2025 (2-06 O 271/25) that accepted AI-overview liability was possible but declined to grant relief on those facts. Munich went the further step and prohibited specific false statements.

It is not, on its own, law in the UK or US. What it is — and why it matters far beyond Germany — is the first clear judicial articulation of a principle every operator of an answer engine now has to reckon with: synthesis is authorship, and authorship carries responsibility.

Part 3

Why the two halves are the same story

Read the court’s description of the mechanism against the incident we documented. They line up point for point.

The court’s finding	What citation laundering does
The engine produces “independent, new, and substantive statements”	Provenance is stripped; the claim is restated as the engine’s own
The output made claims not present in the cited sources	Fabricated citations attached to an unsupported claim
“Check the sources yourself” is no defence	The laundered citation defeats exactly that check
Only the operator can verify by comparing sources to output	The harm is invisible to the entity unless something external is verifying continuously

The court arrived independently, through legal reasoning about one publisher dispute, at the same description of the failure mode Entidex set out to measure. That is the validation. Not that we predicted a court ruling — that the mechanism is real, repeatable, and now legally consequential enough that a court has named it.

Part 4

What this means, and where Entidex sits

For brands, publishers and organisations, the practical takeaway is blunt: an AI engine’s confident misstatement about you is no longer a curiosity. It is a documented category of harm with, in at least one jurisdiction, a remedy path. But a remedy path needs evidence, and the laundering step is designed to destroy the evidence at source.

That is the gap Entidex fills. The court named a check that, inside Google, only Google can run. Entidex runs the equivalent check from the outside:

A canonical, verified record

We hold the ground truth for the entity — resolved from authoritative sources, not inferred from what engines happen to say.

Continuous cross-surface observation

We watch what AI engines and other surfaces actually say, and surface the gap through AI Visibility, Sentiment and Share of Voice — with Trust Stack, Narrative Coverage and Maturity as the derived read on whether an entity’s representation is well-sourced, consistent and defensible.

A dated divergence record

When an engine diverges from the verified record, that divergence is captured with a timestamp — a dated record of what was said, when, on which surface, against what was true.

That dated record is precisely what an entity needs beforeit can act — the standing evidence that survives the laundering step, the artifact you would attach to a correction request or hand to a lawyer. In this case the Munich publishers sent a cease-and-desist and Google did not fix it; the dispute escalated because the harm was demonstrable. Demonstrability is the whole game, and it is what we produce.

It also reinforces why we publish a machine-readable verified record for every entity — an agent- and crawler-readable knowledge statement designed to close the AI knowledge gap at the point of ingestion, rather than hoping attribution survives the synthesis layer.

Go deeper with Entidex

The full entity intelligence platform — beyond Explore Entidex

Continuous multi-source entity observation
Alerts when sources diverge or drift
Cross-surface consensus + visibility over time
Evidence-anchored intelligence reports

See pricing

Frequently asked questions

What is citation laundering?

Citation laundering is the process by which an AI search engine consumes authoritative structured data, repackages it as synthesised knowledge, and attributes it to unrelated sources — or to no source at all. The underlying data can be entirely accurate while the provenance presented to the user is fabricated.

What did the German court rule about AI Overviews?

On 28 May 2026 the Regional Court of Munich (LG Munich I, case 26 O 869/26) issued a preliminary injunction barring Google from repeating false statements its AI Overviews had made about two Munich publishers. The court found that because an AI Overview evaluates and combines sources into an independent, new statement, that statement is Google’s own speech — making Google directly liable, not a protected intermediary. The ruling is German, preliminary, non-final, and expected to be appealed.

Is Google legally liable for false AI Overview claims everywhere now?

No. The Munich ruling is a German preliminary injunction and is not, on its own, law in the UK or US. What it is is the first clear judicial articulation of the principle that synthesis is authorship: any operator whose system rewrites sources into a confident answer is exposed to the same argument.

Does schema.org Dataset structured data guarantee an AI engine will cite the source?

No. In the case documented here, the source page carried a well-formed Dataset JSON-LD block naming the creator, a machine-readable download endpoint, and a last-modified timestamp. The structured data influenced discovery — the page ranked for the query — but the AI Overview layer did not honour it for citation attribution. Discovery and attribution are separate problems.

How is citation laundering different from an AI hallucination?

A hallucination is factually wrong output. Citation laundering can present perfectly accurate data with inaccurate provenance. The numbers are real; the cited sources are not where they came from. It is a provenance failure, not a factual one — which makes it far harder for a reader to detect.

How can a publisher detect citation laundering?

Compare the claims an AI answer makes against the sources it cites. Where a cited domain does not actually contain the claimed data, the attribution is fabricated. Entidex tracks this as a distinct signal: for every probe we run, we capture the URLs an AI engine volunteers and verify whether each source genuinely contains the claim attributed to it.

What is the truth gap?

The truth gap is the measurable distance between what is verifiably true about an entity and what AI engines claim is true. Citation laundering creates a second-order truth gap: even when the facts survive, the provenance is erased, so a reader cannot trace the claim back to its origin.

Methods & sources

The incident was identified through Entidex’s standard entity-observation workflow against a verified canonical record. We do not disclose probe internals, model rotation or scoring methodology; the public methodology covers what each metric measures and how to read it.

Ruling details are drawn from contemporaneous reporting of LG Munich I case 26 O 869/26 (28 May 2026), including coverage by The Decoder, heise online and others, and the prior Frankfurt decision 2-06 O 271/25 (September 2025). Quoted phrasing of the court’s reasoning reflects published translations of the German ruling.

This case study is commentary on a legal development and is not legal advice. The Munich ruling is a non-final German decision under appeal and does not establish law in other jurisdictions.

See what AI engines are saying about your entity — against the record of what’s true. Entidex is an entity intelligence platform: we monitor how AI engines describe, recommend, and rank entities across every public surface, and track the citation chains that connect AI claims to their actual sources.

Run a free scan