The triple decoupling: Microsoft is reducing dependence on OpenAI, x86, and the cloud — in one keynote
Read the two-day Build/Computex announcement as a single move and it becomes a decoupling, not a product launch. On the model layer, Microsoft's AI Superintelligence Team shipped MAI-Thinking-1, a 35B-active / ~1T-total sparse MoE reasoning model trained on commercially licensed data and explicitly without distillation from OpenAI's GPT series [1]. On the silicon layer, NVIDIA RTX Spark — 20 Grace cores, a Blackwell GPU with 6,144 CUDA cores, 128GB unified LPDDR5X — replaces Qualcomm, Intel, and AMD as the headline chip for the next wave of Copilot+ class Windows PCs [2]. On the deployment layer, DGX Station for Windows pulls a 20-petaflop, 748GB-coherent, trillion-parameter workstation onto the user's desk, so frontier-scale inference no longer has to live in Azure [3]. Pavan Davuluri framed the through-line cleanly, describing the move as "scaling the full power of Windows from thin-and-light PCs to data-center-class workstations" [3]. The PR talks like a partnership; the architecture talks like Microsoft de-risking three vendor relationships at once.



