Microsoft Introduces MDASH for Large-Scale AI Vulnerability Research

wpnews.pro

cd /news/ai-agents/microsoft-introduces-mdash-for-large… · home › topics › ai-agents › article

[ARTICLE · art-13719] src=infoq.com ↗ pub=2026-05-25T16:30Z topic=ai-agents verified=true sentiment=↑ positive

Microsoft Introduces MDASH for Large-Scale AI Vulnerability Research

Microsoft has introduced MDASH, a multi-model agentic security platform that uses over 100 specialized AI agents to automate large-scale vulnerability discovery across Windows, Hyper-V, and Azure codebases. The system achieved an 88.45% score on the CyberGym benchmark and 100% recall on historical tcpip.sys vulnerabilities, outperforming leading industry benchmarks. The release signals a shift toward orchestrated AI security systems where coordinated agents and validation frameworks matter more than individual model capability.

read2 min views10 publishedMay 25, 2026

Microsoft has introduced a new AI-driven vulnerability discovery system called MDASH, a multi-model agentic security platform designed to automate large-scale code auditing across Windows and other Microsoft software environments. The system combines more than 100 specialized AI agents that work together to scan, validate, debate, and prove vulnerabilities across complex codebases.

The announcement indicates a transition in AI-assisted cybersecurity from individual model testing to more integrated systems that focus on coordinated agents, validation processes, and automated proof generation. Microsoft emphasizes that the overall framework surrounding the models is more significant than any single model, particularly for extensive proprietary codebases like Windows, Hyper-V, and Azure.

According to Microsoft, MDASH achieved an 88.45% score on the public CyberGym benchmark of 1,507 real-world vulnerabilities, outperforming the next highest entry by roughly five points. Internally, the company reports 96% recall on historical clfs.sys

vulnerabilities reviewed by the Microsoft Security Response Center, and 100% recall on historical tcpip.sys

cases.

Source: Microsoft Blog

Rather than relying on a single model or prompt chain, MDASH operates as a multi-stage pipeline. Specialized agents handle scanning, debate, validation, deduplication, and exploitation separately. Microsoft says this architecture helps the system reason across multiple files, identify lifecycle and concurrency bugs, and validate whether a vulnerability is practically exploitable instead of merely theoretical.

A major part of the announcement focused on the idea that future AI security tooling will depend less on raw model capability and more on orchestration systems built around models. Microsoft described MDASH as model-agnostic by design, allowing teams to swap or upgrade models while keeping the surrounding validation, proving, and workflow infrastructure intact.

The release also prompted discussion about the operational risks of large-scale agentic security systems. In a LinkedIn thread, Sandesh KS wrote:

The orchestration layer is exactly where it gets interesting — and dangerous. When specialized agents start coordinating across identity systems, financial monitoring, and cloud infrastructure simultaneously, the blast radius of a single misconfigured permission boundary becomes enormous. The governance layer has to be designed before the agents go live, not retrofitted after the first incident.

MDASH is currently being tested internally by Microsoft security teams and through a limited private preview with selected customers. The company says organizations interested in testing the system can apply through Microsoft Security’s preview program.

source & further reading

infoq.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/microsoft-introduces-mda…

Read original on infoq.com → www.infoq.com/news/2026/05/microsoft-mdash/?utm_…

mentioned entities

Microsoft

MDASH

Windows

Hyper-V

Azure

Microsoft Security Response Center

CyberGym

clfs.sys

metadata

slugmicrosoft-introduces-mdash-for-large-scale-ai-vulnerability-research

topic#ai-agents

secondary4 topics

sentimentpositive

canonicalinfoq.com

navigation

← prevShow HN: SynapCores – AI-native …

next →Feynman AI research Assistant pl…

── more in #ai-agents 4 stories · sorted by recency

blogs.windows.com · 9 Jul · #ai-agents

Evolving Windows vulnerability management to meet the speed of AI-powered discovery

thenextweb.com · 9 Jul · #ai-agents

Zuckerberg confirms Meta is eyeing an AI cloud business to rent out its compute

zdnet.com · 9 Jul · #ai-agents

Microsoft goes all in on new AI-powered Windows security strategy - what it means for you

letsdatascience.com · 9 Jul · #ai-agents

Microsoft expands AI-driven Windows security updates

── more on @microsoft 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required