cd /news/large-language-models/exploring-functional-regimes-inside-… · home topics large-language-models article
[ARTICLE · art-42853] src=discuss.huggingface.co ↗ pub= topic=large-language-models verified=true sentiment=· neutral

Exploring Functional Regimes Inside Small Language Models (Independent Research)

An independent research project analyzed the internal dynamics of small and medium-sized language models, revealing that functional properties become linearly decodable in hidden representations and that models cluster into two behavioral groups based on dynamic and functional profiles. The findings suggest that functional signal depends more on the geometry of representation space than on specific dimensions, aligning with mechanistic interpretability ideas.

read2 min views1 publishedJun 29, 2026

Hi everyone,

Over the last few months I’ve been working on an independent research project exploring the internal dynamics of small and medium-sized language models.

Rather than evaluating models only by their outputs (benchmarks, perplexity, etc.), I’m trying to characterize how their hidden representations evolve during inference.

The project currently covers 7 open models:

The first part of the framework studies internal trajectories through hidden-state dynamics.

Instead of asking “Which model is more accurate?”, I ask:

This produced several reproducible dynamical fingerprints and architecture clusters.

The second phase moves away from pure dynamics and investigates whether different functional properties become linearly decodable inside hidden representations.

Across multiple probe experiments I observed evidence that:

One interesting observation is that the position of these high-capacity regions varies across architectures rather than appearing at identical absolute depths.

The result that surprised me the most came from a series of control experiments.

After training linear probes I compared:

Gaussian noise and feature permutation substantially reduced decodability.

Orthogonal rotations, however, preserved it almost entirely.

That suggests (at least empirically) that the functional signal depends more on the geometry of the representation space than on specific embedding dimensions.

This seems broadly consistent with ideas discussed in mechanistic interpretability about distributed feature directions.

Across several independent audits, the models repeatedly separate into two broad behavioral groups.

Cluster A

These models consistently exhibit similar dynamic and functional profiles.

Cluster B

Despite architectural differences, these models repeatedly cluster together across multiple analyses.

Seeing the same grouping emerge from different metrics was one of the motivations for continuing the project.

I’m now moving from observation toward causal testing.

The next experiments aim to answer questions such as:

This is entirely independent research, so I’d genuinely appreciate feedback.

I’m especially interested in hearing from people working on:

I’d love to know whether these observations resonate with existing work—or whether there are obvious control experiments I should run next.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/exploring-functional…] indexed:0 read:2min 2026-06-29 ·