The Bottleneck Just Moved: From Chips to Megawatts
For two years, the AI scaling debate has revolved around GPU supply, fab capacity, and HBM yields. That frame is now outdated. The binding constraint on frontier AI has migrated upstream — to the substations, transformers, and gas turbines that feed the racks. Dario Amodei laid out the physics in 'Machines of Loving Grace,' arguing that computation has a hard floor: a 'certain minimum energy per bit erased, limiting the density of computation in the world' [1]. That is not a market constraint that capex can dissolve; it is a thermodynamic ceiling that propagates straight into grid planning.
The sell-side has now caught up. Goldman Sachs warned this month that 'the infrastructure foundation on which AI has been constructed will not sustain the AI of tomorrow' [2], and Ford CEO Jim Farley framed it in operator terms: 'Even if the data centers get built, there's still a huge question mark about how the energy sector will support them' [2]. Anthropic itself is the loudest demand signal — Amodei revealed Q1 2026 revenue and usage grew 80-fold on an annualized basis, an expansion the company openly admits its compute footprint cannot absorb [3]. When the lab building the model and the bank financing the rollout both say the limit has shifted from silicon to electrons, the story changes.
BloombergNEF captured the moment by raising its U.S. data-center power forecast 36% in just seven months, to 106 GW by 2035, and naming 'an inflection moment for US grids: the desire to accommodate AI-driven load without undermining reliability or driving up power costs' [4]. The thesis flip is the news. Everything downstream — Stratos, nuclear PPAs, on-site turbines — is a consequence.



