Neural Network Quantization by Learning Low-Loss Subspaces

wpnews.pro

cd /news/neural-networks/neural-network-quantization-by-learn… · home › topics › neural-networks › article

[ARTICLE · art-38788] src=arxiv.org ↗ pub=2026-06-25T04:00Z topic=neural-networks verified=true sentiment=· neutral

Neural Network Quantization by Learning Low-Loss Subspaces

Researchers propose a novel neural network quantization method that learns quantization-aware linear paths in weight space to find low-loss subspaces, enabling direct quantization of the midpoint without performance degradation. The approach avoids the straight-through estimator and explicit discretization during training, achieving performance comparable to quantization-aware training.

read1 min views1 publishedJun 25, 2026

arXiv:2606.25087v1 Announce Type: new Abstract: Neural network quantization aims to find a discrete representation of parameters that preserves the performance of a full-precision (FP) model as faithfully as possible. Enforcing discrete constraints perturbs parameters away from a well-optimized minimum, generally resulting in performance degradation. Recent studies indicate that low-loss FP solutions are not isolated, but instead belong to connected low-loss subspaces of the loss landscape, where the loss maintains nearly the same minimum value. Models sampled from these subspaces are diverse and retain high accuracy. This raises the question: can a quantized model be constructed to lie within a low-loss subspace of the FP model, thereby automatically preserving performance? We address this question by learning quantization-aware linear paths in weight space optimized to minimize loss. We demonstrate that the midpoint of the resulting subspace is, by design, quantization-friendly and that its direct quantization yields performance comparable to that of quantization-aware training. The proposed procedure offers a novel perspective on weight quantization and, in contrast to conventional methods, neither relies on the straight-through estimator nor involves explicit discretization during training.

source & further reading

arxiv.org — original article

── more in #neural-networks 4 stories · sorted by recency

arxiv.org · 25 Jun · #neural-networks

Graph-Based Phonetic Error Correction of Noisy ASR

arxiv.org · 25 Jun · #neural-networks

SEMIR: Topology-Preserving Graph Minors for Thin-Structure Segmentation

arxiv.org · 25 Jun · #neural-networks

Structuring Sparsity: Block-Sparse Featurizers Capture Visual Concept Manifolds

arxiv.org · 25 Jun · #neural-networks

AgentOdyssey: Open-Ended Long-Horizon Text Game Generation for Test-Time Continual Learning Agents

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required