The Inversion: When Compute Costs More Than the People Using It
For four decades the implicit contract of software was that compute was cheap and labor was expensive — that's why the entire SaaS playbook (per-seat pricing, automation as substitution for headcount) ever worked. Agentic AI has broken that contract in less than a year. Nvidia VP Bryan Catanzaro put it on record in Fortune that 'the cost of compute is far beyond the costs of the employees' for his own team [1], and the numbers around him support the claim: Microsoft cancelled most of its Claude Code licenses inside six months because direct usage costs exceeded what those licenses replaced [1], and a single Anthropic client logged $500M of Claude spend in a single month after failing to put per-seat caps on employee usage [2].
The structural problem is that an agent is not a tool that a human operates once per task — it is a process that loops, calls other tools, retries, and re-reads context. Goldman Sachs has projected that agentic workloads will drive a 24x growth in token consumption by 2030, hitting roughly 120 quadrillion tokens per month [1]. That's not a curve that the standard 'inference will get cheaper' rebuttal can outrun, because every efficiency gain in price-per-token is being eaten by an even larger expansion in tokens-per-task. The 'reality check' headline isn't that AI is bad; it's that the unit economics of agentic AI do not yet pencil out against the unit economics of paying a human, and that gap is widening, not closing.



