OpenAI unveils Jalapeño, its first custom AI chip with Broadcom
TECH

OpenAI unveils Jalapeño, its first custom AI chip with Broadcom

54+
Signals

Strategic Overview

  • 01.
    OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom AI accelerator - an 'Intelligence Processor' architected from scratch for LLM inference and pitched as the first chip in a multi-generation compute platform.
  • 02.
    The chip went from initial design to manufacturing tape-out in roughly nine months, which OpenAI calls the fastest ASIC development cycle for a high-performance semiconductor it is aware of, with engineering samples already running ML workloads in the lab - including the GPT-5.3-Codex-Spark model.
  • 03.
    OpenAI says it used its own models to accelerate parts of the chip design, targets substantially better performance-per-watt than current state-of-the-art accelerators, and plans large-scale deployment at gigawatt scale starting late 2026 as part of a full-stack infrastructure strategy.

Deep Analysis

Nine months to tape-out, with the models helping design the chip

The detail that traveled fastest is the timeline - Jalapeño went from initial design to manufacturing tape-out in about nine months, which OpenAI describes as the fastest ASIC development cycle for a high-performance semiconductor it is aware of [1][2]. High-performance accelerators normally take far longer to reach tape-out, so compressing the cycle this hard is the unusual part of the story, and engineering samples are already running ML workloads in the lab at production target frequency and power, including the GPT-5.3-Codex-Spark model [1][3]. The reason given is reflexive - OpenAI says it used its own models to accelerate parts of the design and optimization process, and Greg Brockman said the degree of acceleration was surprising [1]. That framing dominated the social reaction too, where the nine-month build and the 'designed from scratch' claim were the most-shared specifics and the dominant narrative was OpenAI shifting from chip customer to chip designer. The takeaway - if AI-assisted design genuinely compressed a multi-year cadence, the moat is less the chip than the loop that builds the next one faster.

The economics: perf-per-watt and a claimed ~50% cost cut for inference

Jalapeño is an inference chip first, and the pitch is cost. Broadcom CEO Hock Tan said early testing showed roughly 50% cost savings compared to typical AI GPUs, and the chip is reportedly fabricated by TSMC [4]. OpenAI claims significantly better performance-per-watt than current state-of-the-art accelerators, with detailed benchmarks still to come [1][3]. The stated design goal is to combine the throughput of leading accelerators with latency closer to the fastest specialized inference systems, and to stay flexible enough to run all LLMs rather than one model family [2]. The reason this matters is unit economics - inference, not training, is where the recurring bill lives, so cheaper tokens per watt directly lower the cost of running ChatGPT, Codex and the API at scale and chip away at Nvidia's pricing power even when positioned as complementary capacity [5]. The honest caveat - these are self-reported numbers and the detailed benchmarks have not landed yet [1].

The competitive chessboard: shot across Nvidia's bow, Broadcom the quiet winner

The competitive chessboard: shot across Nvidia's bow, Broadcom the quiet winner
Broadcom (AVGO) rose ~2% on the Jalapeño announcement while Nvidia (NVDA) slipped just ~0.26%, signaling the market read it as a Broadcom win rather than an Nvidia threat.

The headline reads as an Nvidia story, but the cleaner read is a Broadcom one. Constellation Research analyst Holger Mueller called Jalapeño 'a shot in front of the bow for Nvidia' [2], and the launch was widely framed as putting Nvidia's pricing power on notice [5]. Yet OpenAI is careful to call the chip a complement to Nvidia rather than a replacement, and the market reaction was telling - AVGO rose roughly 2% while NVDA slipped only about 0.26% on the day [5]. Broadcom's role is to enable customers to design their own silicon, which makes it the structural beneficiary of every hyperscaler that goes custom rather than a single-deal winner [6]. The takeaway - watch who supplies the picks and shovels - the custom-silicon wave can pressure Nvidia margins while Broadcom captures the enablement layer regardless of which lab wins.

The deployment reality check: gigawatt-scale by late 2026, but on a roadmap and on faith

The ambition is concrete and large. Initial large-scale deployment is planned for late 2026 at gigawatt scale with data-center partners, expanding over multiple generations [1][3], and it sits inside the broader October 2025 Broadcom collaboration to deploy 10 gigawatts of OpenAI-designed accelerators from the second half of 2026 through 2029 [7][8]. Demand signals are already attached - Microsoft is reportedly expected to buy about 40% of the chips in the first phase [1]. The skeptic's note is that almost everything load-bearing is still forward-looking - performance-per-watt and the ~50% cost figure are self-reported with no published benchmarks [4], and the timeline runs through partners like Celestica for system integration and TSMC for fabrication [3]. The takeaway - the strategy is credible and the partners are real, but until third-party benchmarks and shipping racks arrive, treat the efficiency claims as a target rather than a result.

Historical Context

2025-09
Reports surfaced that OpenAI was working with Broadcom on an 'XPU' accelerator chip earmarked for 2026 production.
2025-10-13
OpenAI and Broadcom announced a strategic collaboration to develop and deploy 10 gigawatts of OpenAI-designed AI accelerators with Broadcom networking, targeting deployment from the second half of 2026 through end of 2029.
2026-06-24
OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom AI inference chip, after roughly 18 months of joint work and a nine-month design cycle.

Power Map

Key Players
Subject

OpenAI unveils Jalapeño, its first custom AI chip with Broadcom

OP

OpenAI

Designed the accelerator from a blank slate around its understanding of LLM workloads, models, kernels and serving systems, pursuing full-stack vertical integration and using its own models to speed up the design.

BR

Broadcom (AVGO)

Provides core silicon implementation, manufacturing support and networking technology including Tomahawk silicon - positioned as the biggest commercial beneficiary as a custom-chip enabler for hyperscalers. Stock rose roughly 2% on the news.

CE

Celestica

Industrializes the platform via board, rack and full-system integration and scalable production.

TS

TSMC

Reported foundry and manufacturer of the Jalapeño chip.

MI

Microsoft

Reportedly expected to purchase about 40% of the chips in the first phase.

NV

Nvidia (NVDA)

Incumbent AI-accelerator leader framed as the target of competitive pressure, though OpenAI says Jalapeño complements rather than replaces Nvidia. Stock slipped roughly 0.26% on the news.

Fact Check

8 cited
  1. [1] OpenAI and Broadcom unveil Jalapeño, a custom chip built for LLM inference
  2. [2] OpenAI, Broadcom unveil first AI inference chip
  3. [3] OpenAI unveils Jalapeño AI inference chip with Broadcom
  4. [4] OpenAI and Broadcom reveal Jalapeño, first AI chip in partnership
  5. [5] Broadcom (AVGO) and OpenAI new custom AI chip Jalapeño takes aim at Nvidia
  6. [6] Why OpenAI's Jalapeño is really a Broadcom story
  7. [7] OpenAI and Broadcom announce strategic collaboration
  8. [8] OpenAI, Broadcom debut LLM-optimized chip

Source Articles

Top 5

THE SIGNAL.

Analysts

"Positions Jalapeño as part of OpenAI's long-term full-stack infrastructure strategy and an addition to - not a replacement for - Nvidia, and says he was surprised at how much OpenAI's own models accelerated the design."

Greg Brockman
President and Co-Founder, OpenAI

"Says early testing indicates the chip will run OpenAI's most important workloads close to the hardware's theoretical limits."

Richard Ho
Hardware lead, OpenAI

"Frames the OpenAI collaboration as a commitment to scaling physical AI infrastructure and cites roughly 50% cost savings versus typical GPUs in early testing."

Hock Tan
President & CEO, Broadcom

"Frames Jalapeño as a competitive warning to Nvidia, reading the launch as pressure on the incumbent accelerator leader."

Holger Mueller
Analyst, Constellation Research
The Crowd

"We've designed and built our first AI chip: Jalapeño. Designed from the ground up by OpenAI and brought to production with @Broadcom, Jalapeño is purpose-built for the LLM workloads powering ChatGPT, Codex, the API, and future agentic products. Chips are foundational to the AI"

@@OpenAI21044

"OpenAI just unveiled Jalapeño, its first custom AI chip designed from scratch for LLM inference- It is OpenAI moving deeper into the full stack: chips, kernels, memory, networking, racks, scheduling, deployment and product experience. OpenAI has learned from Cerebras-deal what"

@@kimmonismus558

"OpenAI built their own chip - the Jalapeño - designed for inference. They did it in just nine months. Quoting from the blog: 'OpenAI designed the chip from scratch around its deep understanding of LLM fundamentals, informed by its roadmap of models, kernels, serving systems,"

@@AndrewCurran_461
Broadcast
OpenAI President Greg Brockman on new chip: This is a real performance improvement

OpenAI President Greg Brockman on new chip: This is a real performance improvement

OpenAI Unveils First Custom AI Chip With Broadcom | Bloomberg Tech 6/24/2026

OpenAI Unveils First Custom AI Chip With Broadcom | Bloomberg Tech 6/24/2026

OpenAI’s Jalapeño Chip Could Change LLMS Forever

OpenAI’s Jalapeño Chip Could Change LLMS Forever