High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence

wpnews.pro

cd /news/machine-learning/high-probability-pl-sgd-with-markovi… · home › topics › machine-learning › article

[ARTICLE · art-40286] src=arxiv.org ↗ pub=2026-06-26T04:00Z topic=machine-learning verified=true sentiment=· neutral

High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence

Researchers closed a gap in high-probability bounds for stochastic gradient descent under the Polyak-Łojasiewicz condition with Markovian noise, proving optimal linear dependence on mixing time. They also extended the framework to heavy-tailed gradients with a clipped block method, achieving matching lower bounds. The work tightly characterizes optimal mixing-time and tail-exponent dependencies for PL-SGD.

read1 min views1 publishedJun 26, 2026

arXiv:2606.26316v1 Announce Type: new Abstract: We study first-order methods for smooth objectives satisfying the Polyak-\L{}ojasiewicz (PL) condition when gradient samples are generated by an exogenous Markov chain. In the light-tailed setting, prior uniform-in-time high-probability bounds for ordinary Stochastic Gradient Descent (SGD) under a standard growth envelope scale as $\widetilde{O}(t_{mix}^2/k)$, leaving a gap with the $\widetilde{O}(t_{mix}/k)$ expectation bounds. We close this gap using a lag-blocking argument to establish a uniform high-probability guarantee with a leading stochastic term of $\widetilde{O}(t_{mix}/(k+K_0))$ under geometric mixing. We prove this linear dependence on the mixing time is optimal via a matching $\Omega(\sigma^2 t_{mix}/k)$ lower bound on a quadratic objective driven by a persistent two-state chain. We then extend this framework to heavy-tailed Markovian gradients satisfying a stationary finite-$p$-moment condition, $p \in (1,2]$. We design an all-samples clipped block method that uses every Markov transition while mitigating Markovian bias. Under a transition budget $T$, this algorithm achieves a high-probability stochastic error of $\widetilde{O}(\sigma_p^2(t_{mix}/T)^{2(p-1)/p})$. We establish a matching lower bound by reducing PL optimization to heavy-tailed mean estimation for a sticky Markov chain. Ultimately, this work tightly characterizes the optimal polynomial dependence on mixing time for light-tailed PL-SGD, and the optimal heavy-tail exponent and effective-sample-size dependence in the robust regime.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/high-probability-pl-sgd-…

Read original on arxiv.org → arxiv.org/abs/2606.26316

mentioned entities

arXiv

Polyak-Łojasiewicz

SGD

Markov chain

metadata

slughigh-probability-pl-sgd-with-markovian-noise-optimal-mixing-and-tail-dependence

topic#machine-learning

secondary1 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevHo progettato un'infrastruttura …

next →Cannes Briefing: Creativity is m…

── more in #machine-learning 4 stories · sorted by recency

arxiv.org · 26 Jun · #machine-learning

Forget, Anticipate and Adapt: Test Time Training for Long Videos

arxiv.org · 26 Jun · #machine-learning

Life After Benchmark Saturation: A Case Study of CORE-Bench

arxiv.org · 26 Jun · #machine-learning

Where Larger Models Excel: The Primacy of Constraint-Guided Reasoning

arxiv.org · 26 Jun · #machine-learning

Context Recycling for Long-Horizon LLM Inference

── more on @arxiv 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required