Microsoft evaluates different open models for Cowork

wpnews.pro

cd /news/large-language-models/exclusive-microsoft-evaluates-differ… · home › topics › large-language-models › article

[ARTICLE · art-32758] src=testingcatalog.com ↗ pub=2026-06-18T14:40Z topic=large-language-models verified=true sentiment=· neutral

Microsoft evaluates different open models for Cowork

Microsoft is evaluating a range of open and open-weight models, beyond DeepSeek, for its Copilot Cowork agentic work app, aiming to enable flexible model swapping based on task type, cost, and latency. The move could reduce dependence on single providers and offer enterprise customers more pricing and model choices, while creating internal competition for Microsoft's own MAI models.

read2 min views28 publishedJun 18, 2026

TestingCatalog has learned that Microsoft teams behind Copilot Cowork are evaluating a broader set of open and open-weight models beyond DeepSeek as potential underlying options for the agentic work app. Axios recently reported that Microsoft is considering a Microsoft-hosted version of DeepSeek as a cheaper model option for Copilot Cowork, but the evaluation appears to extend beyond a single model family.

The key architectural detail is the separation between the model layer and the harness. Copilot Cowork is being built so the orchestration system can remain stable while the underlying models are swapped depending on task type, cost, latency, and required capability. That would allow Microsoft to route some work to frontier APIs, while other parts could be handled by cheaper self-hosted models on Azure. Over time, some lighter tasks may also become candidates for local execution as smaller models mature.

This direction would make sense for Copilot Cowork, which is now generally available for Microsoft 365 Copilot customers and is aimed at enterprise users who expect reliability, governance, and data controls. For customers, the main benefit would be more flexible pricing and model choice. For Microsoft, it offers a way to reduce dependence on any single external model provider while controlling compute costs for long-running agentic workflows.

At the same time, this creates an internal competition. Microsoft has been presenting its own MAI models as part of a broader push to become a top-tier AI lab next to OpenAI, Anthropic, and Google. If Chinese and open-weight models continue to perform strongly enough to power parts of Copilot Cowork, Microsoft’s internal model teams will face a clearer benchmark: they need to compete with fast-moving open-model providers.

According to sources familiar with the evaluation, these open-model developments have not reached production in Copilot Cowork yet. For now, they remain under active testing. The practical question is not whether Microsoft can plug in another model, but which model can meet enterprise expectations once cost, compliance, safety, and task quality are measured together.

source & further reading

testingcatalog.com — original article Microsoft tests new MAI Realtime voice model Google is aiming to close feature gaps on Gemini desktop Thinking Machines launched open-weight Inkling-Small

~/api · this article 200

$curl api.wpnews.pro/v1/news/exclusive-microsoft-eval…

Read original on testingcatalog.com → www.testingcatalog.com/exclusive-microsoft-evalu…

mentioned entities

Microsoft

Copilot Cowork

DeepSeek

OpenAI

Anthropic

Google

MAI

Azure

metadata

slugexclusive-microsoft-evaluates-different-open-models-for-cowork

topic#large-language-models

secondary4 topics

sentimentneutral

canonicaltestingcatalog.com

navigation

← prevOpenAI is a non profit after all

next →Accenture had its worst stock da…

── more in #large-language-models 4 stories · sorted by recency

authoryze.ai · 2 Aug · #large-language-models

Show HN: Authoryze- payment controls for AI agents

pub.towardsai.net · 2 Aug · #large-language-models

DeepSeek-V4-Flash: the $0.28 Model that Just Embarrassed the AI Industry’s Pricing

morningstar.com · 2 Aug · #large-language-models

30% of certificate credentials reported on LinkedIn are now AI-related

dev.to · 2 Aug · #large-language-models

One API key across OpenAI, Claude and Gemini: how to compare token cost per model

── more on @microsoft 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 2 Aug · #artificial-intelligence

I Ran 8 AI APIs Through the Same 50 Prompts — Here's the Real Cost Breakdown

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required