Tier1 Workshop
← Back to workshop
Defensible Benchmarks · verify before you sign up

Agents already work. Here's the receipts.

Three numbers from this year, each with a primary source you can open in a new tab. This is the public version of the credibility tile we show in the workshop, built so a sceptical operator can fact-check it before paying for a seat.

Source posture

Primary links first, methodology linked separately, and the as-of date travels with the tile so screenshots stay accountable.

Agents already work. Here's the receipts.
How to read this. Every number above links to its primary source — open them. The three logos are shown in greyscale on purpose: the numbers carry the slide, not the brands. We refresh these figures monthly; the "current as of" date moves even when the numbers don't, so the tile never goes stale-looking. Questions about methodology? The arXiv paper is the GAIA method of record.