DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

wpnews.pro

cd /news/large-language-models/deepseek-v4-towards-highly-efficient… · home › topics › large-language-models › article

[ARTICLE · art-33537] src=arxiv.org ↗ pub=2026-06-19T04:00Z topic=large-language-models verified=true sentiment=↑ positive

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

DeepSeek AI released preview versions of its DeepSeek-V4 series, including two Mixture-of-Experts language models with up to 1.6 trillion parameters and support for one-million-token contexts. The models feature architectural innovations like hybrid attention and a new optimizer, achieving state-of-the-art performance while significantly reducing inference costs for long-context tasks.

read1 min views1 publishedJun 19, 2026

arXiv:2606.19348v1 Announce Type: new Abstract: We present a preview version of DeepSeek-V4 series, including two strong Mixture-of-Experts (MoE) language models -- DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated) -- both supporting a context length of one million tokens. DeepSeek-V4 series incorporate several key upgrades in architecture and optimization: (1) a hybrid attention architecture that combines Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) to improve long-context efficiency; (2) Manifold-Constrained Hyper-Connections (mHC) that enhance conventional residual connections; (3) and the Muon optimizer for faster convergence and greater training stability. We pre-train both models on more than 32T diverse and high-quality tokens, followed by a comprehensive post-training pipeline that unlocks and further enhances their capabilities. DeepSeek-V4-Pro-Max, the maximum reasoning effort mode of DeepSeek-V4-Pro, redefines the state-of-the-art for open models, outperforming its predecessors in core tasks. Meanwhile, DeepSeek-V4 series are highly efficient in long-context scenarios. In the one-million-token context setting, DeepSeek-V4-Pro requires only 27% of single-token inference FLOPs and 10% of KV cache compared with DeepSeek-V3.2. This enables us to routinely support one-million-token contexts, thereby making long-horizon tasks and further test-time scaling more feasible. The model checkpoints are available at https://huggingface.co/collections/deepseek-ai/deepseek-v4.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/deepseek-v4-towards-high…

Read original on arxiv.org → arxiv.org/abs/2606.19348

mentioned entities

DeepSeek

DeepSeek-V4-Pro

DeepSeek-V4-Flash

DeepSeek-V4-Pro-Max

DeepSeek-V3.2

Hugging Face

metadata

slugdeepseek-v4-towards-highly-efficient-million-token-context-intelligence

topic#large-language-models

secondary3 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevNewegg deal drops RTX 5060 Ti 16…

next →Stop Saying "It Works on My Mach…

── more in #large-language-models 4 stories · sorted by recency

huggingface.co · 24 Apr · #large-language-models

DeepSeek-V4: a million-token context that agents can actually use

andlukyane.com · 24 Apr · #large-language-models

DeepSeek-V4 Review: Why Million-Token Context Needs Efficient Attention, Not Just Larger Windows

letsdatascience.com · 19 Jun · #large-language-models

AI-skilled Workers Command 56% Wage Premium

flourishlabs.ai · 19 Jun · #large-language-models

Flourish Labs: $500M to reinvent AI using neuroscience [pdf]

── more on @deepseek 3 stories trending now

wpnews · 18 Jun · #large-language-models

ICYMI: ZAI launches GLM-5.2 open model with 1M context

wpnews · 18 Jun · #ai-chips

Apple and Intel join forces in Trump’s push to bring chipmaking home

wpnews · 18 Jun · #ai-agents

How to Automate Business Reports With an AI Agent Instead of Dashboards

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required