Exploring Functional Regimes Inside Small Language Models (Independent Research)

wpnews.pro

cd /news/large-language-models/exploring-functional-regimes-inside-… · home › topics › large-language-models › article

[ARTICLE · art-42853] src=discuss.huggingface.co ↗ pub=2026-06-29T02:23Z topic=large-language-models verified=true sentiment=· neutral

Exploring Functional Regimes Inside Small Language Models (Independent Research)

An independent research project analyzed the internal dynamics of small and medium-sized language models, revealing that functional properties become linearly decodable in hidden representations and that models cluster into two behavioral groups based on dynamic and functional profiles. The findings suggest that functional signal depends more on the geometry of representation space than on specific dimensions, aligning with mechanistic interpretability ideas.

read2 min views1 publishedJun 29, 2026

Hi everyone,

Over the last few months I’ve been working on an independent research project exploring the internal dynamics of small and medium-sized language models.

Rather than evaluating models only by their outputs (benchmarks, perplexity, etc.), I’m trying to characterize how their hidden representations evolve during inference.

The project currently covers 7 open models:

The first part of the framework studies internal trajectories through hidden-state dynamics.

Instead of asking “Which model is more accurate?”, I ask:

This produced several reproducible dynamical fingerprints and architecture clusters.

The second phase moves away from pure dynamics and investigates whether different functional properties become linearly decodable inside hidden representations.

Across multiple probe experiments I observed evidence that:

One interesting observation is that the position of these high-capacity regions varies across architectures rather than appearing at identical absolute depths.

The result that surprised me the most came from a series of control experiments.

After training linear probes I compared:

Gaussian noise and feature permutation substantially reduced decodability.

Orthogonal rotations, however, preserved it almost entirely.

That suggests (at least empirically) that the functional signal depends more on the geometry of the representation space than on specific embedding dimensions.

This seems broadly consistent with ideas discussed in mechanistic interpretability about distributed feature directions.

Across several independent audits, the models repeatedly separate into two broad behavioral groups.

Cluster A

These models consistently exhibit similar dynamic and functional profiles.

Cluster B

Despite architectural differences, these models repeatedly cluster together across multiple analyses.

Seeing the same grouping emerge from different metrics was one of the motivations for continuing the project.

I’m now moving from observation toward causal testing.

The next experiments aim to answer questions such as:

This is entirely independent research, so I’d genuinely appreciate feedback.

I’m especially interested in hearing from people working on:

I’d love to know whether these observations resonate with existing work—or whether there are obvious control experiments I should run next.

source & further reading

discuss.huggingface.co — original article Rakarrack-0.6.1 port making progress! ( AI assisted ) Cloud Storage Poll Welcome to Haiku basic(Haiku Docs, Haiku slide and Haiku sheets)

~/api · this article 200

$curl api.wpnews.pro/v1/news/exploring-functional-reg…

Read original on discuss.huggingface.co → discuss.huggingface.co/t/exploring-functional-re…

metadata

slugexploring-functional-regimes-inside-small-language-models-independent-research

topic#large-language-models

secondary3 topics

sentimentneutral

canonicaldiscuss.huggingface.co

navigation

← prevShow HN: I Made a WebGPU Based A…

next →Seoul bets big on global startup…

── more in #large-language-models 4 stories · sorted by recency

discuss.huggingface.co · 29 Jun · #large-language-models

I analyzed hidden-state dynamics across 7 open-weight LLMs and found recurring functional patterns. Looking for feedback

ianbarber.blog · 29 Jun · #large-language-models

It’s always the learning rates

runtimewire.com · 28 Jun · #large-language-models

Sean Du brings a reasoning-model hallucination detector to ICML 2026

axios.com · 29 Jun · #large-language-models

Anthropic Claude Fable 5, on track to return soon (possibly this week)

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required