NVIDIA RTX Spark and agentic AI hardware stack unveiled at Computex 2026
TECH

NVIDIA RTX Spark and agentic AI hardware stack unveiled at Computex 2026

55+
Signals

Strategic Overview

  • 01.
    NVIDIA unveiled the RTX Spark superchip at Computex 2026 / GTC Taipei, pairing a 20-core Grace ARM CPU with a Blackwell RTX GPU (6,144 CUDA cores, FP4 Tensor Cores) over NVLink-C2C, delivering up to 1 petaflop of AI compute and 128GB of unified memory.
  • 02.
    Over 30 RTX Spark laptops and 10 desktops ship fall 2026 from ASUS, Dell, HP, Lenovo, Microsoft Surface, and MSI, with chassis as thin as 14mm and as light as 3 pounds; Acer and GIGABYTE follow.
  • 03.
    Alongside the laptop chip, NVIDIA announced Vera, an 88-Olympus-core CPU 'built for AI agents' claiming 1.8x x86 task throughput, plus the NemoClaw / OpenShell agent runtime and an expanded Microsoft partnership unifying Windows, Foundry, and Azure into one agentic stack.
  • 04.
    DGX Spark, the deskside personal AI supercomputer that shares the same Grace Blackwell silicon, launched at $3,999 and has since been hiked 18% to $4,699 Founders Edition MSRP because of global memory supply constraints - not the $2,999 figure widely repeated in initial coverage.

Deep Analysis

Why Huang says the 40-year PC era is over

NVIDIA's framing of RTX Spark is not that it is a faster laptop chip - it is that the laptop chip is now the wrong abstraction. The superchip welds a 20-core Grace ARM CPU to a Blackwell RTX GPU over NVLink-C2C, exposes 128GB of unified memory across both, and reaches 1 petaflop of FP4 compute (with sparsity) in a 14mm chassis [1]. That topology is closer to a miniature data-center node than to a traditional Wintel laptop, and it lets RTX Spark systems run 120B-parameter LLMs with up to 1M token context, edit 12K 4:2:2 video, and still play AAA titles at 1440p above 100fps [1]. Jensen Huang's keynote line - 'For forty years, you launched apps. Click. Type. With RTX Spark and Microsoft Windows, you ask - and the PC does the work' - is the marketing veneer over a hardware argument: agentic workloads need the CPU and GPU sharing one memory pool because the model, the planner, and the tools all read and write the same context [1]. The Register notes that NVIDIA explicitly recast the same GB10 silicon used in DGX Spark as a high-end PC play, with MediaTek co-designing the ARM half - giving NVIDIA its first credible Windows-on-Arm processor and aiming directly at Intel, AMD, and Qualcomm's Snapdragon X [2]. Tom's Hardware reads it the same way, calling Spark a platform that 'promises to turn Windows into an agentic AI OS' [3]. Whether or not the era is actually over, NVIDIA has committed the silicon, the OEM roster, and the OS partner to acting as if it is.

The bandwidth problem r/LocalLLaMA will not shut up about

The strongest counter-narrative is not coming from competitors - it is coming from the people who would actually run local agents. NVIDIA's marketing leans on 128GB of unified memory and 1 petaflop FP4 as the headline numbers, but the LocalLLaMA community has zeroed in on a spec NVIDIA buries: RTX Spark's memory bandwidth is, by those threads' read, far below the 1.2TB/s Vera CPUs get from LPDDR5X and the 1.8TB/s NVLink-C2C link Vera shares with its GPU partner [4]. The technical reading on Reddit is blunt: the chip lands in a dead zone where it cannot push enough tokens per second to run inference comfortably on the giant models the 128GB pool advertises, nor saturate the GPU for top-tier gaming - users propose alternative builds - high-bandwidth x86 workstation memory at similar prices - and the verdict is that NVIDIA's pool is large enough to fit the model but not fast enough to serve it interactively. The same community frames the architecture as deliberately gimped to protect NVIDIA's data-center revenue: give consumers enough RAM to fit a 120B model but not enough bandwidth to actually serve it interactively. AppleInsider's hands-on benchmarks reinforce the skepticism from a different angle, finding the N1X CPU's single-core Geekbench score of 3,096 roughly equal to Apple's M3 Max from 2023 [5]. None of this kills the product - the OEM lineup and Windows integration alone will move units - but it explains why technical Reddit threads with thousands of upvotes are tonally hostile while the X.com cycle is euphoric: people who actually intend to run local LLMs are doing the bandwidth math, and the bandwidth math is not flattering.

DGX Spark's 18% price hike is the memory-supply tell

DGX Spark, the deskside personal AI supercomputer that shares Grace Blackwell silicon with RTX Spark, launched at $3,999 - already $1,000 above the $2,999 figure circulating in pre-launch coverage [6]. Within weeks NVIDIA raised the Founders Edition MSRP to $4,699, an 18% jump of $700, citing global memory supply constraints [7]. That detail matters because the entire agentic-PC pitch rests on cramming 128GB of high-density unified memory into a thin laptop, and the same supply squeeze that is repricing DGX Spark will hit RTX Spark OEMs at exactly the moment they need volume. Leaked OEM pricing already reflects the pressure: N1X-tier RTX Spark laptops are tipped to start around $2,899 with an entry-level N1 tier at ~$1,799 [8]. Constellation's Larry Dignan argues the real DGX Spark thesis was never the developer desktop - 'The real impact of DGX Spark will come from enterprise deployments at the edge... DGX Spark's impact is likely to be seen in physical AI' [6]. Read alongside the price hike, that is a different story than the keynote tells: the silicon that NVIDIA is positioning as personal AI is, on the supply curve, behaving like a constrained data-center component.

Token economics: why Huang's accounting reframe matters

Underneath the laptop story is a balance-sheet story. SiliconAngle's coverage of the GTC Taipei keynote highlights Huang's framing that 'Tokens are now profitable units of revenues... every token is profitable, every token is revenues' [9]. That sentence is the strategic glue between RTX Spark on the desk, DGX Station with up to 748GB coherent memory and 20 petaflops FP4 on the deskside, and Vera plus Vera Rubin GPUs in the cloud [10]. NVIDIA is no longer selling parallel-compute parts; it is selling a token-production pipeline, with the same Grace Blackwell architecture appearing at every tier so a model trained on Azure or run via OpenShell agents on a Surface Laptop Ultra speaks the same dialect end-to-end [10]. ZK Research's Zeus Kerravala reads the keynote as the moment AI moves from research curiosity to production economics, with infrastructure choices made now setting competitive position in an AI-saturated market [9]. The unveils that look like separate products - RTX Spark, Vera, NemoClaw / OpenShell, the expanded Microsoft partnership - are better read as one bet: that the unit of value in computing is shifting from app-hours to tokens-served, and that whoever owns the cheapest, most coherent path from prompt to token wins the next decade [11]. The Reddit skepticism on bandwidth and the AppleInsider skepticism on CPU pedigree are not wrong; they are simply arguing about a different axis than NVIDIA is.

Historical Context

2016
Launched the original DGX-1 AI supercomputer; the 2026 DGX Spark exceeds that machine's performance envelope in a deskside footprint and at a fraction of the cost.
2025-2026
Apple's M-series (now M5) and Qualcomm's Snapdragon X established premium ARM AI PCs as the new norm, defining the market RTX Spark is now muscling into.
2026-03
Announced NemoClaw at GTC, seeding the OpenShell agent runtime months ahead of the Computex hardware unveil so software was ready when silicon shipped.
2026-05
Received the first Vera CPU deliveries - NVIDIA's debut as a standalone CPU vendor selling silicon directly to frontier AI labs.
2026-05-31
Jensen Huang's Computex 2026 / GTC Taipei keynote unveiled the RTX Spark superchip, the Vera CPU, the multi-generation Spark roadmap, and an end-to-end agentic AI stack.
2026-06-02
Build 2026 keynote jointly detailed the unified agentic stack from Windows AI PCs through DGX Station to Foundry and Azure, including a GB300-based DGX Station for Windows with 748GB coherent memory.

Power Map

Key Players
Subject

NVIDIA RTX Spark and agentic AI hardware stack unveiled at Computex 2026

NV

NVIDIA

Designer of RTX Spark, Vera, DGX Spark, and the NemoClaw/OpenShell agent runtime; repositioning itself from GPU vendor to full-stack 'AI infrastructure company' that owns device, deskside, and data-center silicon.

MI

Microsoft

Co-architect of the agentic Windows stack: integrates RTX Spark into the Surface Laptop Ultra, embeds OpenShell into GitHub Copilot, deploys Vera Rubin and Anthropic Claude on Azure, and unifies Foundry plus Windows into one agentic platform.

ME

MediaTek

Co-designed the ARM CPU half of the Grace Blackwell RTX Spark superchip, giving NVIDIA its first credible Windows-on-Arm processor path.

AS

ASUS, Dell, HP, Lenovo, MSI, Microsoft Surface (+ Acer, GIGABYTE)

Launch OEMs shipping RTX Spark laptops and desktops in fall 2026; ASUS unveiled ProArt P16, P14, and Mini PC Spark systems while the Surface Laptop Ultra is positioned as Microsoft's flagship.

AN

Anthropic, OpenAI, Oracle Cloud Infrastructure

First confirmed Vera CPU customers; Anthropic frames Vera as 'a promising part of the ecosystem' for agentic workloads, and OCI plans to deploy hundreds of thousands of Vera CPUs starting 2026.

AP

Apple

The benchmark RTX Spark explicitly targets; Apple's M-series and upcoming M5 with GPU-core neural accelerators remain the on-device AI standard, and early single-core Geekbench numbers suggest Spark trails an M3 Max from 2023.

IN

Intel and Qualcomm

Incumbent Windows silicon vendors most directly disrupted; Huang declared the 40-year traditional PC era 'at an end,' a frame aimed squarely at Wintel and Snapdragon X.

Fact Check

11 cited
  1. [1] NVIDIA and Microsoft Reinvent Windows PCs for the Age of Agents
  2. [2] NVIDIA Recasts GB10 Superchip in Bid for High-End PC Market
  3. [3] NVIDIA Unveils RTX Spark Superchip at Computex 2026
  4. [4] NVIDIA Unveils Vera, the CPU for AI Agents
  5. [5] NVIDIA's N1X Apple Silicon Rival Is Two Years Behind
  6. [6] NVIDIA DGX Spark Now Available for $3,999: Real Impact Will Be at the AI Edge
  7. [7] NVIDIA DGX Spark Gets 18% Price Increase as Memory Shortages Bite
  8. [8] NVIDIA RTX Spark Laptop Prices Leaked
  9. [9] Five Thoughts on Jensen Huang's GTC Taipei 2026 Keynote
  10. [10] Microsoft Build: Unified Agentic AI Stack from Windows to Azure
  11. [11] NVIDIA Announces NemoClaw

Source Articles

Top 5

THE SIGNAL.

Analysts

"Frames RTX Spark plus Windows as the end of the click-and-type PC paradigm and the start of an agentic OS where users state intent and the machine executes: 'For forty years, you launched apps. Click. Type. With RTX Spark and Microsoft Windows, you ask - and the PC does the work.'"

Jensen Huang
CEO, NVIDIA

"Argues agentic workloads will be the dominant compute consumer, justifying a CPU class purpose-built for them rather than retrofitted from x86 server lineage: 'AI agents will be the largest users of computing. Vera is the first CPU designed for that future.'"

Jensen Huang
CEO, NVIDIA

"Positions the partnership as democratizing local AI compute, with on-device inference no longer rationed by cloud quotas: 'Our goal is to deliver unmetered intelligence to every home and every desk with Windows.'"

Satya Nadella
CEO, Microsoft

"Cautiously bullish on Vera as scaling compute for agentic workloads: 'Scaling compute is an important accelerant for the growth of models. We're excited to see Vera emerge as a promising part of the ecosystem when solving for agentic workloads.'"

James Bradbury
Head of Compute, Anthropic

"Treats Vera as a sustained-performance bet for agentic AI at hyperscale: 'OCI plans to deploy hundreds of thousands of NVIDIA Vera CPUs beginning in 2026 because agentic AI demands sustained performance at massive scale.'"

Karan Batta
Product Management Lead, Oracle Cloud Infrastructure

"Skeptical that first-generation Spark silicon meaningfully threatens Apple, noting early benchmarks show CPU performance trailing an M3 Max from late 2023: 'On the CPU front, it is trailing behind a chip from Apple that's more than two years old... It probably could. Eventually.'"

Malcolm Owen
Writer, AppleInsider

"Reframes DGX Spark away from the desktop-dev-box narrative; the unit economics only make sense once enterprises deploy the same silicon to physical AI at the edge: 'The real impact of DGX Spark will come from enterprise deployments at the edge... DGX Spark's impact is likely to be seen in physical AI.'"

Larry Dignan
Editor in Chief, Constellation Insights
The Crowd

"This is the NVIDIA RTX Spark Superchip. A new beginning for personal computers. Designed for creators, AI developers, and gamers, RTX Spark brings over 30 years of NVIDIA innovation to slim Windows laptops and small, ultra-efficient desktop PCs."

@@NVIDIARTXSpark4677

"At Computex, NVIDIA walked into a market it never owned: the PC itself. RTX Spark is an ARM superchip: 20-core Grace CPU, Blackwell GPU with 6,144 CUDA cores, up to 128GB unified memory, 1 PetaFLOP FP4. The GPU core count sits at RTX 5070 (!) level, in a laptop as thin as a..."

@@kimmonismus505

"NVIDIA has officially unveiled its RTX Spark platform for Arm-based PCs. It combines a 20-core Grace CPU, Blackwell RTX GPU, NPU, and up to 128GB of unified LPDDR5X memory into a single package designed for AI-focused laptops and mini PCs. Key specs: - 20-core Grace CPU..."

@@yabhishekhd553

"NVIDIA just announced the RTX Spark CPU, developed with Microsoft, at Computex."

@u/pedro197288
Broadcast
Nvidia's New Laptops - RTX Spark

Nvidia's New Laptops - RTX Spark

NVIDIA RTX Spark Reinvents Windows PCs for the Age of Personal AI

NVIDIA RTX Spark Reinvents Windows PCs for the Age of Personal AI

Early Preview of NVIDIA RTX Spark at Computex

Early Preview of NVIDIA RTX Spark at Computex