OpenAI Unveils Jalapeno LLM Inferencing Accelerator Built in Collaboration with Broadcom

wpnews.pro

cd /news/large-language-models/openai-unveils-jalapeno-llm-inferenc… · home › topics › large-language-models › article

[ARTICLE · art-38124] src=techpowerup.com ↗ pub=2026-06-24T17:06Z topic=large-language-models verified=true sentiment=↑ positive

OpenAI Unveils Jalapeno LLM Inferencing Accelerator Built in Collaboration with Broadcom

OpenAI unveiled Jalapeno, its in-house LLM inference accelerator chip developed with Broadcom, achieving tape-out in nine months. The chip is designed to accelerate inferencing for OpenAI's LLM stack powering ChatGPT, Codex, and future agentic AI products, with initial deployment targeted for late 2026.

read1 min views1 publishedJun 24, 2026

OpenAI today unveiled Japaneo, its in-house, purpose-built LLM inference accelerator chip, developed in collaboration with Broadcom. The chip has fixed-function and programmable compute hardware to accelerate inferencing of OpenAI LLM stack that powers ChatGPT, Codex, the OpenAI API, and the company's future agentic AI products. It is conceptually similar to Google's TPU, except designed for the OpenAI stack; and while TPU is used for both training and inferencing, Jalapeno appears to be limited to inferencing. The training is probably still done with GPUs.

The company said that in partnership with Broadcom, the chip was designed and reached manufacturing tape-out in just nine months, making it the fastest ASIC development cycle among advanced semiconductors. Jalapeno is part of a compute platform that will span multiple generations of these chips. The platform is being designed for initial deployment toward the end of 2026. There are no technical details about the chip itself, except that it's a contemporary multi-chip module with an interposer, a large and centrally-located logic tile, flanked by eight HBM3E stacks.

source & further reading

techpowerup.com — original article Games Made with Gen AI Suffer Up to 53% Worse Sales on Steam (PR) Transcend Launches New 218-Layer 3D NAND Storage Solutions for AI Computing and Edge Applications (PR) Biostar Introduces EdgeComp MS-NAT5000 Edge AI System

~/api · this article 200

$curl api.wpnews.pro/v1/news/openai-unveils-jalapeno-…

Read original on techpowerup.com → www.techpowerup.com/350253/openai-unveils-jalape…

mentioned entities

OpenAI

Broadcom

Jalapeno

ChatGPT

Codex

Google

TPU

metadata

slugopenai-unveils-jalapeno-llm-inferencing-accelerator-built-in-collaboration-with

topic#large-language-models

secondary4 topics

sentimentpositive

canonicaltechpowerup.com

navigation

← prevI’ve built and tested hundreds o…

next →Snowflake CEO finds GLM-5.2 comp…

── more in #large-language-models 4 stories · sorted by recency

devclubhouse.com · 24 Jun · #large-language-models

How OpenAI's Jalapeño Chip Changes Production LLM Serving

cautiousoptimism.news · 24 Jun · #large-language-models

Why OpenAI’s new chip will help it go faster

cryptobriefing.com · 24 Jun · #large-language-models

Broadcom targets over $100B in AI semiconductor revenue by 2028

letsdatascience.com · 24 Jun · #large-language-models

OpenAI and Broadcom Unveil Jalape1o Inference Processor

── more on @openai 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 22 Jun · #large-language-models

MCP vs Skills: Why Skills Save Context Tokens

wpnews · 22 Jun · #artificial-intelligence

Value for Money Is All You Need

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required