Learning from almost nothing: How neural networks survive heavy input corruption

wpnews.pro

cd /news/machine-learning/learning-from-almost-nothing-how-neu… · home › topics › machine-learning › article

[ARTICLE · art-24757] src=arxiv.org ↗ pub=2026-06-12T04:00Z topic=machine-learning verified=true sentiment=· neutral

Learning from almost nothing: How neural networks survive heavy input corruption

Neural networks maintain well-above-chance accuracy on classification tasks even when over 90% of input data is corrupted, far exceeding human recognition capabilities. Researchers analyzing multi-layer perceptrons under heavy attribute noise found that networks implement a nearest-class-mean prototype rule, assigning test points to the class whose training-set average they most closely resemble. This universal centroid mechanism, derived using a mean-field-inspired approach, explains why learning succeeds even when individual training examples carry almost no signal.

read1 min publishedJun 12, 2026

arXiv:2606.11319v1 Announce Type: new Abstract: Learning from imperfect data is a central theme in machine learning, connecting practical questions of robustness to fundamental questions of learnability. Here we examine attribute noise: learning from corrupted inputs while keeping the labels intact, a setting that has received considerably less analytical attention than its label-noise counterpart. We consider two types of corruption models: additive noise and replacement noise. Through experiments with multi-layer perceptrons (MLPs) on corrupted classification datasets, we find that neural networks remain robust, maintaining well-above-chance accuracy even when inputs are >90% corrupted -- far beyond human recognition. To understand this robustness, we analyze infinite-width networks in the heavy-corruption regime using a mean-field-inspired approach and derive a leading-order decision rule for the classification outcome: the network implements a prototype rule, the nearest-class-mean, assigning each test point to the class whose training-set average it most closely resembles. This leading-order decision rule is universal across a broad range of MLP architectures, holding for any depth, as well as a wide class of activation functions and noise distributions. The same centroid mechanism closely matches finite-width network behavior in our experiments and provides an interpretable and analytically tractable account of why learning can succeed even when individual training examples carry almost no signal.

source & further reading

arxiv.org — original article

── more in #machine-learning 4 stories · sorted by recency

dev.to · 13 Jun · #machine-learning

How Much RAM Do You Really Need to Run LLMs Locally? 2026 Benchmarks

news.ycombinator.com · 13 Jun · #machine-learning

What task are humans still better at than AI?

fabricatedknowledge.com · 13 Jun · #machine-learning

Mythos and Engels' Pause

psychologytoday.com · 13 Jun · #machine-learning

AI in Eating Disorders: Support Tool or Silent Risk?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required