cd /news/large-language-models/automatic-llm-routing-that-optimizes… · home topics large-language-models article
[ARTICLE · art-31654] src=factory.ai ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Automatic LLM routing that optimizes cost and speed

Factory AI launched Factory Router, an automatic model selection system that routes each task to the optimal large language model, cutting costs by up to 25% while maintaining frontier performance. The system provides 99.9%+ request reliability by routing across models and providers, and is available in private research preview for enterprise customers.

read2 min views1 publishedJun 17, 2026

Automatic model selection for every Droid session. Factory Router picks the right model for each task, maintains frontier performance, and cuts cost by up to 25%.

Automatic model selection for every Droid session. Factory Router picks the right model for each task, maintains frontier performance, and cuts cost by up to 25%.

Enterprise AI costs are climbing, and a bigger token bill does not mean more work is getting done. To avoid losing on performance, engineers usually default to the most performant model for all tasks. Simple questions, mechanical refactors, documentation updates, small bug fixes, and search-heavy investigations end up on the same premium path as work that truly needs frontier performance. Budgets get exhausted without a clear increase in organization-level output.

Today you pick a model per task and lean on the most expensive one to be safe. With Factory Router you choose once and it picks the best model for each session.

Compared with Claude Opus 4.7, Factory Router maintains frontier performance at lower cost per session. At enterprise scale, those savings apply across every Droid session, with spend tied to the work being done rather than a blanket default to the most expensive model.

When a provider degrades, rate limits hit, or capacity gets constrained, your sessions keep going. Factory Router routes across models, providers, and capacity to deliver 99.9%+ request reliability.

If a provider path degrades, Factory Router keeps the session running on the same model through a healthy provider. Enterprise customers get reserved throughput for critical work instead of relying only on shared public capacity.

Factory Router keeps frontier models available as they come online, so high-complexity work gets the strongest model class.

Route eligible work to US-hosted open-source models when you need cost-efficient or controlled options.

Routing guidance brings your team's context into Factory Router, so automatic model selection reflects how work actually happens inside your organization. The same policy surfaces that govern other Factory models apply here, so admins manage access, compliance, and eligibility without a separate control plane.

Factory Router is in private research preview in the Factory CLI and Desktop App. Once enabled for your org, it appears in the model picker for every user with no setup required. Mission workers can use it too, so long-running autonomous work gets the same automatic model selection and savings as interactive and headless sessions.

start building

Start building

── more in #large-language-models 4 stories · sorted by recency
── more on @factory ai 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/automatic-llm-routin…] indexed:0 read:2min 2026-06-17 ·