OpenAI and Broadcom unveil "Jalapeño," a custom chip built for LLM inference

wpnews.pro

cd /news/large-language-models/openai-and-broadcom-unveil-jalapeno-… · home › topics › large-language-models › article

[ARTICLE · art-37822] src=the-decoder.com ↗ pub=2026-06-24T13:50Z topic=large-language-models verified=true sentiment=· neutral

OpenAI and Broadcom unveil "Jalapeño," a custom chip built for LLM inference

OpenAI and Broadcom unveiled 'Jalapeño,' a custom chip designed for large language model inference, claiming better performance per watt. Development took nine months with OpenAI's models aiding the process. Large-scale deployment is planned for late 2026, with Microsoft expected to buy 40 percent of the chips.

read3 min views1 publishedJun 24, 2026

OpenAI and Broadcom unveil "Jalapeño," a custom chip built for LLM inference — Image: The Decoder

Key Points #

OpenAI and Broadcom have unveiled "Jalapeño," a custom chip built specifically for large language model inference.
OpenAI says the architecture delivers better performance per watt. Development took just nine months, with OpenAI's own models helping speed up the process.
Custom hardware is meant to make running AI models cheaper and more reliable. Large-scale deployment is planned for late 2026, with Microsoft expected to buy 40 percent of the chips.

OpenAI is adding custom hardware to its tech stack. The "Jalapeño" chip, developed with Broadcom, is tailored for large language model inference and is set to run at scale by late 2026.

According to a joint announcement, OpenAI and Broadcom have unveiled "Jalapeño" - OpenAI's first so-called "Intelligence Processor." It's a custom accelerator built specifically for large language model inference, and the first chip in a multi-generation platform the two companies are building together.

Broadcom CEO Hock Tan and President Charlie Kawwas handed the first wafer to OpenAI CEO Sam Altman and President Greg Brockman. For OpenAI, this marks its first step into custom hardware after years of focusing on models and products.

OpenAI says Jalapeño isn't a modified general-purpose chip. It was designed from scratch for modern LLM inference. OpenAI handles the chip design, Broadcom contributes silicon manufacturing and networking technology including its Tomahawk networking chips, and Celestica takes care of boards, racks, and system integration.

Performance claims lack independent verification #

Early tests showed performance per watt that's "substantially better" than current state-of-the-art hardware, according to OpenAI. These are self-reported numbers that haven't been finalized. Take them with a grain of salt. A technical report is supposed to follow. Right now, it's unclear which chips Jalapeño was tested against, on what tasks, and under what conditions.

The architecture reportedly cuts data movement and pushes utilization closer to its theoretical max. Engineering samples are already running ML workloads in the lab, including the GPT-5.3-Codex-Spark model. That model currently runs on Cerebras hardware, which also specializes in inference.

OpenAI says the process from design to tape-out took just nine months, what the company calls the fastest ASIC development cycle for high-performance semiconductors it's aware of. OpenAI's own models helped speed up parts of the design process. The rumors about chip plans, though, have been circulating since 2023.

The announcement reflects OpenAI's argument that controlling the full stack from chip to product lets it run models faster, more reliably, and at lower cost. Broadcom CEO Tan says the first deployment is planned for late 2026 at gigawatt scale, together with Microsoft and other partners. Broadcom has reportedly demanded that Microsoft guarantee it will buy 40 percent of the chips to secure the first phase.

AI News Without the Hype – Curated by Humans

					Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.				

					Subscribe now

OpenAI

source & further reading

the-decoder.com — original article OpenAI's deployment chief on Codex growth, falling AI prices, and the ROI question Vibecoding is becoming a deal-breaker test for software acquisitions Pangram CEO says language models give themselves away by making the same arguments

~/api · this article 200

$curl api.wpnews.pro/v1/news/openai-and-broadcom-unve…

Read original on the-decoder.com → the-decoder.com/openai-and-broadcom-unveil-jalap…

mentioned entities

OpenAI

Broadcom

Jalapeño

Microsoft

Sam Altman

Greg Brockman

Hock Tan

Charlie Kawwas

metadata

slugopenai-and-broadcom-unveil-jalapeno-a-custom-chip-built-for-llm-inference

topic#large-language-models

secondary3 topics

sentimentneutral

canonicalthe-decoder.com

navigation

← prevOpenAI tests first homegrown AI …

next →Chinese supercomputer displaces …

── more in #large-language-models 4 stories · sorted by recency

cryptobriefing.com · 24 Jun · #large-language-models

OpenAI and Broadcom unveil LLM-optimized intelligence processor in 10-gigawatt chip partnership

cryptobriefing.com · 24 Jun · #large-language-models

OpenAI tests first homegrown AI chip Jalapeño for customer queries

startupfortune.com · 24 Jun · #large-language-models

OpenAI and Broadcom unveil Jalapeño, a custom inference chip that puts Nvidia's pricing power on notice

dev.to · 24 Jun · #large-language-models

Why I Run AI Locally Instead of Using ChatGPT for Client Work

── more on @openai 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 22 Jun · #large-language-models

MCP vs Skills: Why Skills Save Context Tokens

wpnews · 22 Jun · #ai-agents

Anthropic's engineering leader says Claude Code is making programmers lonelier

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required