Stop overloading your skills

wpnews.pro

cd /news/ai-agents/stop-overloading-your-skills · home › topics › ai-agents › article

[ARTICLE · art-33351] src=devblogs.microsoft.com ↗ pub=2026-06-18T14:25Z topic=ai-agents verified=true sentiment=↓ negative

Stop overloading your skills

Microsoft warns developers that overloading AI coding agents with redundant documentation wastes tokens and degrades performance. The company advises measuring baseline model knowledge first, then building lean skills that only address gaps where the model fails.

read3 min views25 publishedJun 18, 2026

You built a skill for your technology. API references, authentication flows, SDK patterns, error handling, version info, all packed into one skill. The agent calls it, gets all that context, and generates code. The kicker? You’ve just wasted a lot of tokens.

It already knows #

Models have ingested your documentation, your Stack Overflow answers, your GitHub repos, your blog posts. The default imports, the standard auth flow, the common CRUD operations: the model already has all of that baked in.

When your skill repeats what the model already knows, you’re not helping, you’re adding weight. Every token your skill returns occupies space in a finite context window, and those tokens aren’t neutral. They push out the stuff the model doesn’t know: workspace files, conversation history, or output from other tools.

Most skills suffer from the same problem. Stuffed with thousands of tokens of documentation, called by the agent, payload returned, and outcomes don’t improve. Sometimes they get worse, because the skill is doing work the model didn’t need help with.

How do you know what the model knows? #

You don’t, unless you measure. And most folks skip this step entirely. They go straight from “we have a technology” to “we need a skill” without checking what the model does on its own.

What if the model already generates correct code for your API 90% of the time? Then you need a lightweight skill that covers the remaining 10%: the auth quirk that trips people up, the breaking change that’s too recent for training data, the configuration pattern that looks like nothing else on the web. You can’t know which 10% to target if you haven’t measured the baseline.

Measure first, build second #

Start by running your scenarios without the skill. Same model, same harness, same prompts. See what the agent gets right and what it gets wrong, because that’s your baseline.

If the model handles CRUD correctly, don’t put CRUD examples in your skill. If auth flows work out of the box, don’t include your auth guide. If it picks the right SDK version, don’t waste tokens telling it which version to use. What’s left after you subtract the baseline? The patterns the model gets wrong or doesn’t know about at all. That’s your skill’s scope. Nothing more.

Every unnecessary token is drag #

Context windows have a fixed budget. A skill that returns 3,000 tokens of documentation the model already knows is burning context that could hold the developer’s workspace files, conversation thread, or output from another tool the model needs.

It gets worse when skills compose. Your developers have other skills installed, and each one claims tokens just by being present. Your oversized skill isn’t just dragging its own scenarios, it’s eating into the budget other skills need. You’re not just hurting your outcomes, you’re contributing to everyone else’s drag.

The lean skill #

Define scenarios: the tasks developers actually ask agents to do with your technology.
Run them without your skill, and score the outcomes.
Identify where the model fails: those failures are your scope.
Build a skill that addresses onlythe gaps. - Measure again. Confirm you’re producing lift, not drag. Also pay attention to the token count: lift at 3x the tokens is a net loss.

Do this and you’ll end up with skills a fraction of their original size that produce measurably better results. In many cases, models don’t need a textbook. They needed a cheat sheet.

source & further reading

devblogs.microsoft.com — original article C++ Dependencies Without the Headache: Vcpkg and Copilot CLI The Microsoft 365 Copilot Agent’s Playbook: A Practical Livestream Series for Building Better Agents The Microsoft Agent Framework Harness is now released

~/api · this article 200

$curl api.wpnews.pro/v1/news/stop-overloading-your-sk…

Read original on devblogs.microsoft.com → devblogs.microsoft.com/blog/stop-overloading-you…

mentioned entities

Microsoft

Stack Overflow

GitHub

metadata

slugstop-overloading-your-skills

topic#ai-agents

secondary3 topics

sentimentnegative

canonicaldevblogs.microsoft.com

navigation

← prevHow I cut accessibility remediat…

next →New Adobe tool shows where brand…

── more in #ai-agents 4 stories · sorted by recency

dev.to · 3 Aug · #ai-agents

Building Coordination Infrastructure: What 32 MCP Servers Without a Bus Look Like

dev.to · 3 Aug · #ai-agents

Using the New Copilot Studio Skills

dev.to · 3 Aug · #ai-agents

AI Is Great at Reasoning. Stop Using It for Workflows.

dev.to · 3 Aug · #ai-agents

Run Claude, Codex, and Gemini in One Unified Desktop App

── more on @microsoft 3 stories trending now

wpnews · 2 Aug · #artificial-intelligence

I Ran 8 AI APIs Through the Same 50 Prompts — Here's the Real Cost Breakdown

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

wpnews · 2 Aug · #artificial-intelligence

Payment Rail vs. Settlement Layer: What AEON's Coinbase x402 Partnership Actually Validates

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required