Reading Calibrated Uncertainty from Language Model Trajectories

wpnews.pro

cd /news/large-language-models/reading-calibrated-uncertainty-from-… · home › topics › large-language-models › article

[ARTICLE · art-13542] src=arxiv.org ↗ pub=2026-05-25T04:00Z topic=large-language-models verified=true sentiment=· neutral

Reading Calibrated Uncertainty from Language Model Trajectories

A new method for measuring uncertainty in language model outputs, developed by researchers and detailed in a preprint on arXiv, extracts geometric features from the model's internal layer-by-layer processing to outperform the standard maximum softmax probability approach. The technique uses a sparse linear probe on scale-invariant features from MLP update trajectories, achieving up to 21 points improvement in AURC under selective abstention. The probe's interpretable coefficients reveal where along the model's depth errors form, including premature commitments and trajectory drift, offering a calibrated alternative for structured output generation.

read1 min views5 publishedMay 25, 2026

arXiv:2605.22864v1 Announce Type: new Abstract: The maximum softmax probability (MSP) represents a default approach when evaluating uncertainty quantification for language model generation with structured output. Although cheap, it is often miscalibrated. Methods that probe the model's internal activations feed raw hidden states into opaque classifiers, reading activations as static snapshots and leaving implicit the layer-wise trajectory by which a representation is formed. Yet, similar endpoints can arise from very different paths, and how evidence accumulates, reinforces, or reverses across depth might reveal uncertainty that final probabilities obscure. We extract eleven scale-invariant geometric features, tracing the cumulative path of per-layer MLP updates, and feed them to a sparse linear probe. The probe outperforms MSP under selective abstention, with gains scaling with baseline miscalibration up to 21 AURC points. Because every feature has a closed-form geometric meaning, the probe's coefficients trace how and where along depth errors take shape -- which layers commit prematurely, which contradict the running state, where trajectories drift away from their endpoint.

source & further reading

arxiv.org — original article

── more in #large-language-models 4 stories · sorted by recency

machinebrief.com · 10 Jul · #large-language-models

Revamping Neural Topology: The Cost of Precision

404media.co · 10 Jul · #large-language-models

AI Fiction Is Easy to Detect Because It's Stupid and Bad, Research Finds

machinebrief.com · 10 Jul · #large-language-models

Why AI Struggles in Real-World Negotiations

machinebrief.com · 10 Jul · #large-language-models

Are Large Language Models Really Getting Smarter?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required