Feature Lottery? A Bifurcation Theory of Concept Emergence

wpnews.pro

cd /news/machine-learning/feature-lottery-a-bifurcation-theory… · home › topics › machine-learning › article

[ARTICLE · art-14017] src=arxiv.org ↗ pub=2026-05-26T04:00Z topic=machine-learning verified=true sentiment=· neutral

Feature Lottery? A Bifurcation Theory of Concept Emergence

Researchers have developed a bifurcation theory that detects when neural networks acquire structured representations during training in real time, without relying on retrospective or label-dependent metrics. The theory identifies a universal, label-free phase coordinate—a dynamic ratio computed from hidden states—that predicts four distinct transition regimes across diverse settings including language models, self-supervised learning, and grokking. This framework provides an early-warning indicator for training health, detecting the onset of usable structure and representational collapse epochs before downstream metrics react.

read1 min views12 publishedMay 26, 2026

arXiv:2605.24057v1 Announce Type: new Abstract: Neural networks acquire structured representations at specific moments during training, yet identifying these transitions typically relies on retrospective, label-dependent metrics. We introduce a bifurcation theory of representation dynamics to detect these moments in real time. Analyzing a passive GMM probe attached to the evolving encoder, we show the onset of structure corresponds to a supercritical pitchfork bifurcation driven by the loss Hessian. The system exhibits a theoretically predictable zero-crossing ($\beta_c$) that, compared to the network's current state ($\beta$), yields a dynamic ratio $\beta(t)/\beta_c(t)$: a universal, label-free phase coordinate for representation dynamics, computable entirely from hidden states. We empirically validate four distinct transition regimes predicted by this coordinate across diverse settings: SAEs on language models (Pythia), SSL (CIFAR), and grokking (modular arithmetic). Crucially, under finite dissipation, macroscopic symmetry-breaking can lag the initial zero-crossing by orders of magnitude, which providing a rigorous dynamical account of the delayed escape observed in grokking. Microscopically, the bifurcation creates a shared unstable subspace, forcing collective symmetry breaking. We term this the "feature lottery" in SAE training: a feature's terminal interpretability becomes predictable remarkably early. By only 5% of training, early atom purity robustly predicts final convergence purity, with top-decile early atoms achieving over 12x the baseline purity at convergence. Beyond explaining concept emergence, $\beta/\beta_c$ provides a practical early-warning indicator for training health, detecting the onset of usable structure, the crystallization of feature identity, and representational collapse epochs before downstream metrics react.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/feature-lottery-a-bifurc…

Read original on arxiv.org → arxiv.org/abs/2605.24057

mentioned entities

Pythia

CIFAR

metadata

slugfeature-lottery-a-bifurcation-theory-of-concept-emergence

topic#machine-learning

secondary3 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevShow HN: Self-hosted collaborati…

next →Google Enters The Ecommerce Wars…

── more in #machine-learning 4 stories · sorted by recency

arxiv.org · 10 Jul · #machine-learning

Architecture Generalization with MetaNCA

dev.to · 10 Jul · #machine-learning

Large Language Models Demystified: A Visual and Practical Guide

machinebrief.com · 10 Jul · #machine-learning

Predicting the Future: Reducing 5G Coordination Delays with AI

machinebrief.com · 10 Jul · #machine-learning

Cracking the Code: Streamlined Pruning for Large Language Models

── more on @pythia 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required