cd /news/large-language-models/llm-guesses-or-knows · home topics large-language-models article
[ARTICLE · art-26989] src=dev.to ↗ pub= topic=large-language-models verified=true sentiment=· neutral

LLM guesses or knows

A developer tested two large language models to determine when they are guessing versus when they actually know the answer by analyzing logprobs. The Nano model showed miscalibrated confidence, assigning 87% probability to '4' for 'what is 2+2' but also collapsing entropy when copying input, while the Mini model displayed honest uncertainty with no candidate above 31% for a story prompt. The developer concludes that Nano is not recommended for agentic autonomous decision making.

read1 min publishedJun 14, 2026

We wanted to know when LLM is guessing versus when it actually knows the answer.

LLM models expose logprobs - after every word they generate, you can request the top alternative tokens and their probabilities. Low entropy means the model was certain, high means it was guessing.

I tested two models using 0.0 temperature and 3 prompts: "what is 2+2", "the opposite of hot is", "once upon a time." The results:

For the math prompt, Nano gave "4" at 87% and "2" at 13%. For the story prompt, it echoed the input back verbatim, so entropy collapsed because it was copying input, not generating. Mini generated the story with five competing candidates, none above 31%. That's honest uncertainty. Both LLM models are cheap and still popular, but Nano is miscalibrated and not recommended for agentic autonomous decision making.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/llm-guesses-or-knows] indexed:0 read:1min 2026-06-14 ·