I built a token-level debugger for comparing two LLMs

wpnews.pro

cd /news/artificial-intelligence/i-built-a-token-level-debugger-for-c… · home › topics › artificial-intelligence › article

[ARTICLE · art-13961] src=dev.to ↗ pub=2026-05-26T00:14Z topic=artificial-intelligence verified=true sentiment=↑ positive

I built a token-level debugger for comparing two LLMs

A developer built Tokenflame, a token-level debugger that compares outputs from two different large language models given the same prompt. The tool generates entropy heatmaps, tokenizer diffs, divergence markers, and token-by-token replay in a single HTML file with one command.

read1 min views15 publishedMay 26, 2026

Same prompt, two models, different outputs. No tooling was actually showing me where they diverged.

Built tokenflame that gives entropy heatmaps, tokenizer diffs, divergence markers, token-by-token replay. One command, one HTML file.

pip install tokenflame

## Top comments (0)


For further actions, you may consider blocking this person and/or [reporting abuse](/report-abuse)

source & further reading

dev.to — original article Anyone else noticing Claude being more stubborn, lying to you with high confidence that things the way he says to find out it's complete non sense? Adding real payments to a Base44 app (3 insertion points, tested) How I Built an AI Decision Copilot to Help India Prepare for the 2026 El Niño Crisis

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-built-a-token-level-de…

Read original on dev.to → dev.to/bn3020/i-built-a-token-level-debugger-for…

mentioned entities

tokenflame

metadata

slugi-built-a-token-level-debugger-for-comparing-two-llms

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevWhat is 'pink-slime' journalism …

next →Accenture (ACN) Invests in Aera …

── more in #artificial-intelligence 4 stories · sorted by recency

twitter.com · 10 Jul · #artificial-intelligence

Meta agentic model Muse Spark 1.1

dev.to · 10 Jul · #artificial-intelligence

FinPal - I Built a Finance App You Can Actually Ask Questions To

musingimage.com · 10 Jul · #artificial-intelligence

Show HN: Musing Image

machinebrief.com · 10 Jul · #artificial-intelligence

Breaking Down the PS4 Framework for Conversational AI

── more on @tokenflame 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required