The reversal: AWS stops cutting and starts charging

For most of its history, AWS was a metronome of price cuts - and Capacity Blocks for ML were no exception, with three reductions across 2024 and 2025 [4]. 2026 broke the pattern. A quiet ~15% increase landed in January, reported by both The Register and DatacenterDynamics on the H200 instances [6][7], and now a third consecutive hike of about 20% takes effect July 1 [2]. When the largest cloud provider raises rather than cuts prices on its scarcest product, it is a market signal: demand growth has outpaced capacity expansion, and the era of compute getting reliably cheaper has, at least for frontier GPUs, paused [8]. The cumulative effect is steep - across chip configurations, AWS GPU reservation pricing has climbed roughly 20% to 50% over the course of 2026 from the stacked increases [8]. This is not a discount cycle blip; it is a structural reset in how guaranteed AI compute is priced.


