DeepSeek Releases DSpark, a Speculative Decoding Framework That Accelerates DeepSeek-V4 Per-User Generation 60–85% Over MTP-1

wpnews.pro

cd /news/large-language-models/deepseek-releases-dspark-a-speculati… · home › topics › large-language-models › article

[ARTICLE · art-41968] src=marktechpost.com ↗ pub=2026-06-27T16:59Z topic=large-language-models verified=true sentiment=↑ positive

DeepSeek Releases DSpark, a Speculative Decoding Framework That Accelerates DeepSeek-V4 Per-User Generation 60–85% Over MTP-1

DeepSeek open-sourced DSpark, a speculative decoding framework that accelerates per-user generation for DeepSeek-V4 by 57–85% over the MTP-1 baseline. The framework pairs a parallel draft backbone with a Markov head and confidence-scheduled verification to reduce suffix decay and adapt to real-time GPU load. DSpark achieves lossless speedups with accepted length gains of 16–31% over prior methods.

read1 min views1 publishedJun 27, 2026

DeepSeek open-sourced DSpark, a speculative decoding framework that attaches a draft module to existing DeepSeek-V4 weights. It pairs a parallel draft backbone with a lightweight Markov head to cut suffix decay, then adds confidence-scheduled verification that tailors how many tokens get checked to real-time GPU load. Offline, accepted length rises 16–31% over DFlash and Eagle3; in production it speeds per-user generation 57–85% over the MTP-1 baseline, losslessly. The training repo, DeepSpec, ships under MIT.

The post DeepSeek Releases DSpark, a Speculative Decoding Framework That Accelerates DeepSeek-V4 Per-User Generation 60–85% Over MTP-1 appeared first on MarkTechPost.

source & further reading

marktechpost.com — original article Meta’s Astryx Brings a CLI and MCP Server to an Open-Source React Design System Agents Can Read Building Supervised Fine-Tuning Data from NVIDIA Open-SWE-Traces: Trajectory Parsing, Patch Analysis, Token Budgets, and Tool-Use Metrics Cursor Study Finds Reward Hacking Inflates Coding-Agent Benchmark Scores on SWE-bench Pro

~/api · this article 200

$curl api.wpnews.pro/v1/news/deepseek-releases-dspark…

Read original on marktechpost.com → www.marktechpost.com/2026/06/27/deepseek-release…

mentioned entities

DeepSeek

DeepSeek-V4

DSpark

MTP-1

DFlash

Eagle3

DeepSpec

MarkTechPost

metadata

slugdeepseek-releases-dspark-a-speculative-decoding-framework-that-accelerates-v4-60

topic#large-language-models

secondary2 topics

sentimentpositive

canonicalmarktechpost.com

navigation

← prevKorean Workers Vote to Go On Str…

next →Anthropic's Fable 5 could return…

── more in #large-language-models 4 stories · sorted by recency

byteiota.com · 27 Jun · #large-language-models

DeepSeek DSpark Goes Live with 80% Inference Speed Gains

github.com · 27 Jun · #large-language-models

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

code.intellios.ai · 27 Jun · #large-language-models

Native Hacker News TUI client with AI comments summary written in Golang

dev.to · 27 Jun · #large-language-models

I Cut My OpenAI Bill by 94% Using Chinese AI Models — Here's Exactly How

── more on @deepseek 3 stories trending now

wpnews · 25 May · #artificial-intelligence

Maia-3: free and open source

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required