Pre-Warm: Input-Conditioned Weight Initialization for Convolutional Neural Networks

wpnews.pro

cd /news/machine-learning/pre-warm-input-conditioned-weight-in… · home › topics › machine-learning › article

[ARTICLE · art-38793] src=arxiv.org ↗ pub=2026-06-25T04:00Z topic=machine-learning verified=true sentiment=↑ positive

Pre-Warm: Input-Conditioned Weight Initialization for Convolutional Neural Networks

Researchers introduced Pre-Warm, a zero-training-cost method for data-conditioned initialization of the first convolutional layer in CNNs. The technique clusters patches from a single training batch to initialize half of the first-layer filters, yielding statistically significant accuracy improvements over standard Kaiming initialization across five benchmarks. Pre-Warm adds negligible overhead and requires no architectural changes.

read1 min views1 publishedJun 25, 2026

arXiv:2606.25256v1 Announce Type: new Abstract: We introduce Pre-Warm, a simple yet effective zero-training-cost method for data-conditioned initialization of the first convolutional layer. Before the first forward pass, Pre-Warm extracts mean-centered local patches from a single training batch, clusters them with MiniBatchKMeans, applies inverse Manhattan spatial weighting, and uses the resulting centroids to initialize half of the first-layer filters (the remainder retain Kaiming initialization). We derive closed-form rules for all hyperparameters except a single insensitive scale parameter, though we derive a Kaiming parity bound on scale from patch dimensionality. For grayscale datasets we use Otsu's foreground density; for natural color images we use the mean L2 norm of mean-centered patches. Both rules accurately predict the optimal patch count observed in grid search. Across five standard benchmarks -- MNIST, Fashion-MNIST, CIFAR-10, SVHN, and CIFAR-100 -- and 8-seed paired experiments, Pre-Warm yields statistically significant accuracy improvements over standard Kaiming initialization (p < 0.05 on all datasets, p = 0.0007 on SVHN with 8/8 wins, p = 0.0033 on CIFAR-100 with 7/8 wins). The method adds negligible overhead, requires no architectural changes, and integrates into existing training pipelines with only a few lines of code. Pre-Warm demonstrates that even a lightweight, input-dependent signal can meaningfully improve optimization trajectories in modern convolutional networks.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/pre-warm-input-condition…

Read original on arxiv.org → arxiv.org/abs/2606.25256

mentioned entities

Pre-Warm

MiniBatchKMeans

Kaiming initialization

MNIST

Fashion-MNIST

CIFAR-10

SVHN

CIFAR-100

metadata

slugpre-warm-input-conditioned-weight-initialization-for-convolutional-neural

topic#machine-learning

secondary2 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevChinese models are sometimes bet…

next →Most teams will ship AI-written …

── more in #machine-learning 4 stories · sorted by recency

arxiv.org · 25 Jun · #machine-learning

SEMIR: Topology-Preserving Graph Minors for Thin-Structure Segmentation

arxiv.org · 25 Jun · #machine-learning

Structuring Sparsity: Block-Sparse Featurizers Capture Visual Concept Manifolds

arxiv.org · 24 Jun · #machine-learning

Are Safety Guarantees in Neural Networks Safe? How to Compute Trustworthy Robustness Certifications

arxiv.org · 25 Jun · #machine-learning

Chorus II: Cross-Request Sparsity Reuse for Efficient Image-to-Video Generation

── more on @pre-warm 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 24 Jun · #ai-policy

An AI startup is suing the US government for taking away Anthropic's new model

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required