cd /news/large-language-models/exclusive-microsoft-evaluates-differ… · home topics large-language-models article
[ARTICLE · art-32758] src=testingcatalog.com ↗ pub= topic=large-language-models verified=true sentiment=· neutral

Microsoft evaluates different open models for Cowork

Microsoft is evaluating a range of open and open-weight models, beyond DeepSeek, for its Copilot Cowork agentic work app, aiming to enable flexible model swapping based on task type, cost, and latency. The move could reduce dependence on single providers and offer enterprise customers more pricing and model choices, while creating internal competition for Microsoft's own MAI models.

read2 min views1 publishedJun 18, 2026

TestingCatalog has learned that Microsoft teams behind Copilot Cowork are evaluating a broader set of open and open-weight models beyond DeepSeek as potential underlying options for the agentic work app. Axios recently reported that Microsoft is considering a Microsoft-hosted version of DeepSeek as a cheaper model option for Copilot Cowork, but the evaluation appears to extend beyond a single model family.

The key architectural detail is the separation between the model layer and the harness. Copilot Cowork is being built so the orchestration system can remain stable while the underlying models are swapped depending on task type, cost, latency, and required capability. That would allow Microsoft to route some work to frontier APIs, while other parts could be handled by cheaper self-hosted models on Azure. Over time, some lighter tasks may also become candidates for local execution as smaller models mature.

This direction would make sense for Copilot Cowork, which is now generally available for Microsoft 365 Copilot customers and is aimed at enterprise users who expect reliability, governance, and data controls. For customers, the main benefit would be more flexible pricing and model choice. For Microsoft, it offers a way to reduce dependence on any single external model provider while controlling compute costs for long-running agentic workflows.

At the same time, this creates an internal competition. Microsoft has been presenting its own MAI models as part of a broader push to become a top-tier AI lab next to OpenAI, Anthropic, and Google. If Chinese and open-weight models continue to perform strongly enough to power parts of Copilot Cowork, Microsoft’s internal model teams will face a clearer benchmark: they need to compete with fast-moving open-model providers.

According to sources familiar with the evaluation, these open-model developments have not reached production in Copilot Cowork yet. For now, they remain under active testing. The practical question is not whether Microsoft can plug in another model, but which model can meet enterprise expectations once cost, compliance, safety, and task quality are measured together.

── more in #large-language-models 4 stories · sorted by recency
── more on @microsoft 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/exclusive-microsoft-…] indexed:0 read:2min 2026-06-18 ·