cd /news/large-language-models/show-hn-deepseek-flash-inverted-the-… · home topics large-language-models article
[ARTICLE · art-40102] src=rtrvr.ai ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Show HN: DeepSeek Flash inverted the economics of agent products

DeepSeek Flash, an open-source model, has inverted the economics of AI agent products by offering superior performance at a fraction of the cost, reducing API prices by over 100x compared to competitors like Gemini 3 Flash. Developers are switching to DeepSeek to avoid high model lab taxes, with Microsoft even adopting it for Copilot. The shift highlights the advantage of text-only models over multimodal approaches for cost-efficient agent harnesses.

read1 min views1 publishedJun 25, 2026

There is an adversarial relationship between developers and big model labs.

Model labs charged developers higher API prices to subsidize their own agent harness offerings. Think Anthropic charging 5x higher Claude API prices to subsidize consumer subscriptions. So Cursor in a way was subsidizing their own direct competitor.

DeepSeek V4 Flash totally inverted this relationship. Now you have a model that beats even Sonnet in some benchmarks and is totally opensourced. Now inference providers are racing to the bottom to optimize and give cheaper hosting. Every player with a non-SOTA is now racing to swap over to stop paying the big model lab tax, even Microsoft is switching Copilot to use DeepSeek.

On switching over to Deepseek: - we noticed over a 100x cost decrease while similar or better performance then Gemini 3 Flash - insane saving from the cached input tokens: $0.002/1 Million tokens - both DeepSeek Flash and GLM 5.2 are text-only models, so clearly multimodal training is not worth the additional cost. Language is just a much more efficient sparse representation of the world/reasoning than vision - we had a early bet on a text-only web agent harness, and now with DeepSeek this results in a unique cost advantages. - we rewrote our harness as a callable DSL library that a model can generate code to execute on. DeepSeek has proven phenomenal on code generation to drive an agent harness. - I would highly recommend everyone to rewrite their harness to be text-only and callable via executable code

Comments URL: [https://news.ycombinator.com/item?id=48680260](https://news.ycombinator.com/item?id=48680260)

Points: 3

── more in #large-language-models 4 stories · sorted by recency
── more on @deepseek 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/show-hn-deepseek-fla…] indexed:0 read:1min 2026-06-25 ·