Entidex · Signal Layer
We probe Claude, ChatGPT and Gemini at regular intervals and grade their answers against a known ground truth. The result is a live picture of LLM awareness lag — per model, per topic, with retrieval on or off.
Tracking awareness scores for ARM Holdings over recent probes.
How well each LLM knows ARM Holdings — by topic and retrieval mode.
| Model | Retrieval OFF | Retrieval ON |
|---|---|---|
| Claude | 85 | 76 |
| ChatGPT | 82 | 82 |
| Gemini | 85 | 85 |
| Model | Retrieval OFF | Retrieval ON |
|---|---|---|
| Claude | 72 | 79 |
| ChatGPT | 75 | 85 |
| Gemini | 72 | 85 |
| Model | Retrieval OFF | Retrieval ON |
|---|---|---|
| Claude | 72 | 79 |
| ChatGPT | 82 | 79 |
| Gemini | 82 | 85 |