The CPU comeback: why agents bend AI spend back toward Graviton
The cleanest way to misread this deal is to call it a chip deal. It is — but the chip that matters is a CPU, not a GPU. AWS's Graviton4 instances ship with 192 Arm Neoverse V3 cores fed by 12 channels of memory running up to 8800 MT/s [4], and those are precisely the cores Snowflake needs to scale Cortex AI. The Register's framing is the key: in an agentic workload, the model still calls a GPU for inference, but the tools the model calls — a SQL query, a Python function, a vector lookup — execute on CPU [4].
As enterprises move from one-shot LLM calls to multi-step agents that orchestrate dozens of tool invocations per task, the cost center shifts from inference seconds to orchestration cycles. That is the workload Graviton was built for, and that is why a 'data warehouse' company is suddenly the largest non-LLM Graviton buyer in the market. Andy Jassy has been claiming for months that AWS silicon offers 'better price-performance' than Nvidia's [3]; the Snowflake commitment is the first enterprise data-platform receipt for that claim.




