cd /news/ai-infrastructure/turiyam-builds-cheaper-ai-inference-… · home topics ai-infrastructure article
[ARTICLE · art-37144] src=letsdatascience.com ↗ pub= topic=ai-infrastructure verified=true sentiment=· neutral

Turiyam builds cheaper AI inference infrastructure for enterprises

Bengaluru startup Turiyam, founded in December 2024, is building a full-stack AI inference platform that bypasses Nvidia hardware and charges customers by finished outputs rather than tokens. The company raised $4 million in a pre-seed round led by Ankur Capital and Axilor's Micelio Fund, and is currently piloting with select enterprises to exploit the paradox of falling token prices and rising enterprise AI budgets.

read3 min views5 publishedJun 24, 2026
Turiyam builds cheaper AI inference infrastructure for enterprises
Image: Letsdatascience (auto-discovered)

Bengaluru startup Turiyam, founded in December 2024 by Sanchayan Sinha, Parag Jain, and Praveen Jain, is building a full-stack AI inference platform that bypasses Nvidia hardware and bills customers by finished outputs rather than by tokens. The Ken reports token prices have fallen roughly 35% over two years while enterprise AI budgets have risen nearly 6x to about $7 million in 2026, a cost paradox Turiyam aims to exploit. Inc42 reported in March 2026 that the company raised $4 million in a pre-seed round led by Ankur Capital and Axilor's Micelio Fund, and is currently piloting with select enterprises. Its architecture pairs custom inference-focused silicon with a compiler-led software stack, targeting performance-per-watt and total cost of ownership rather than raw training throughput.

What Turiyam is building

The Ken reports that Turiyam, a Bengaluru deeptech startup, was founded in December 2024 by Sanchayan Sinha, Parag Jain, and Praveen Jain to build a full-stack AI inference solution. Per The Ken, the company designed a chip specifically for inference rather than training, and the stack omits Nvidia hardware. The Ken also reports Turiyam charges for finished outputs rather than by tokens - a pricing model that shifts cost risk from input compute to delivered results.

Funding and backers

Inc42 reported in March 2026 that Turiyam.ai raised $4 million in a pre-seed round led by Ankur Capital and Axilor's Micelio Fund. The company is using the capital to accelerate product development, expand its team, and support early enterprise and data-centre deployments. Ritu Verma, managing partner of Ankur Capital, said: "Putting the software stack in place from day one, rather than as an afterthought, is what makes the approach differentiated and relevant for where the market is headed."

Market context

The Ken reports token prices have declined about 35% over two years while enterprise AI budgets have climbed nearly 6x to about $7 million in 2026. That combination - cheaper tokens but far higher total spend - is the gap Turiyam is targeting. A former CTO quoted by The Ken said: "The cost of a single query has collapsed, but our total bill has exploded."

Technical approach

Turiyam's architecture pairs a hybrid memory design with a compiler-led optimization layer aimed at maximizing throughput for inference-heavy workloads while improving performance-per-watt and lowering total cost of ownership. Industry context: specialized inference accelerators from vendors such as Groq and Google have pursued similar tradeoffs by reducing memory and training-centric features in favor of smaller die area and lower energy per token. The startup is currently in pilot deployments with select enterprises.

What to watch

Adoption will hinge on published benchmarks showing total-cost-of-inference advantages at scale, interoperability with dominant model formats and runtimes, and traction with customers already running high-volume inference spend. Competing vendors typically need third-party benchmarks and real-world case studies to win enterprise procurement cycles, especially when incumbent GPU ecosystems already host existing workflows.

Scoring Rationale #

Solid niche story on an early-stage Indian inference-silicon startup that has verified $4M pre-seed backing and active enterprise pilots. The output-based pricing model and Nvidia-free stack are a notable differentiator, but the company is pre-commercial and the story is regionally focused. Modest pull from 6.6 - sits in solid/niche territory rather than 'notable' until third-party benchmarks or larger customer announcements emerge.

Practice interview problems based on real data

1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

── more in #ai-infrastructure 4 stories · sorted by recency
── more on @turiyam 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/turiyam-builds-cheap…] indexed:0 read:3min 2026-06-24 ·