Automatic LLM routing that optimizes cost and speed

wpnews.pro

cd /news/large-language-models/automatic-llm-routing-that-optimizes… · home › topics › large-language-models › article

[ARTICLE · art-31654] src=factory.ai ↗ pub=2026-06-17T19:42Z topic=large-language-models verified=true sentiment=↑ positive

Automatic LLM routing that optimizes cost and speed

Factory AI launched Factory Router, an automatic model selection system that routes each task to the optimal large language model, cutting costs by up to 25% while maintaining frontier performance. The system provides 99.9%+ request reliability by routing across models and providers, and is available in private research preview for enterprise customers.

read2 min views28 publishedJun 17, 2026

Automatic model selection for every Droid session. Factory Router picks the right model for each task, maintains frontier performance, and cuts cost by up to 25%.

Enterprise AI costs are climbing, and a bigger token bill does not mean more work is getting done. To avoid losing on performance, engineers usually default to the most performant model for all tasks. Simple questions, mechanical refactors, documentation updates, small bug fixes, and search-heavy investigations end up on the same premium path as work that truly needs frontier performance. Budgets get exhausted without a clear increase in organization-level output.

Today you pick a model per task and lean on the most expensive one to be safe. With Factory Router you choose once and it picks the best model for each session.

Compared with Claude Opus 4.7, Factory Router maintains frontier performance at lower cost per session. At enterprise scale, those savings apply across every Droid session, with spend tied to the work being done rather than a blanket default to the most expensive model.

When a provider degrades, rate limits hit, or capacity gets constrained, your sessions keep going. Factory Router routes across models, providers, and capacity to deliver 99.9%+ request reliability.

If a provider path degrades, Factory Router keeps the session running on the same model through a healthy provider. Enterprise customers get reserved throughput for critical work instead of relying only on shared public capacity.

Factory Router keeps frontier models available as they come online, so high-complexity work gets the strongest model class.

Route eligible work to US-hosted open-source models when you need cost-efficient or controlled options.

Routing guidance brings your team's context into Factory Router, so automatic model selection reflects how work actually happens inside your organization. The same policy surfaces that govern other Factory models apply here, so admins manage access, compliance, and eligibility without a separate control plane.

Factory Router is in private research preview in the Factory CLI and Desktop App. Once enabled for your org, it appears in the model picker for every user with no setup required. Mission workers can use it too, so long-running autonomous work gets the same automatic model selection and savings as interactive and headless sessions.

start building

Start building

source & further reading

factory.ai — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/automatic-llm-routing-th…

Read original on factory.ai → factory.ai/product/router

mentioned entities

Factory AI

Factory Router

Claude Opus 4.7

metadata

slugautomatic-llm-routing-that-optimizes-cost-and-speed

topic#large-language-models

secondary3 topics

sentimentpositive

canonicalfactory.ai

navigation

← prevWant to join NGA? Bring AI skill…

next →AI for Data Pipelines & ETL in 2…

── more in #large-language-models 4 stories · sorted by recency

startupfortune.com · 2 Aug · #large-language-models

DeepSeek's V4-Flash Undercuts OpenAI and Anthropic on Price Again

byteiota.com · 2 Aug · #large-language-models

VS Code 1.131: See Your Subagents, Speak Your Code

promptcube3.com · 2 Aug · #large-language-models

How Much VRAM to Fine-Tune an LLM? 12 to 120 GB

webflow.com · 2 Aug · #large-language-models

Designing APIs for Agents

── more on @factory ai 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required