The Decision Subtraction Framework: How to Evaluate Any AI Tool

wpnews.pro

cd /news/ai-tools/the-decision-subtraction-framework-h… · home › topics › ai-tools › article

[ARTICLE · art-16328] src=dev.to ↗ pub=2026-05-28T10:39Z topic=ai-tools verified=true sentiment=· neutral

The Decision Subtraction Framework: How to Evaluate Any AI Tool

A developer has created the Decision Subtraction Framework, a three-lens system for evaluating AI tools based on decisions replaced versus created, time saved, and attention consumed. The framework, which includes formulas for Decision Replacement Ratio, Friction Delta, and Attention ROI, identifies five failure modes including erasure cost and error asymmetry. The developer recommends weighting high-stakes decisions over trivial ones and assessing skill dependency by asking what ability would be lost after six months of use.

read3 min views11 publishedMay 28, 2026

Last week someone asked me which AI tools they should be using. The question hides a problem that costs real money: there are more capable AI tools available than any single person can evaluate.

ChatGPT Plus at $20/month. Claude at $20. Grok at $30. Cursor at $20. Copilot at $10. Each with a $100, $200, or $300 variant underneath. Each claims to earn its place.

The real question is not which tool is best. The real question is: which tools subtract more decisions than they add?

Formula: decisions replaced by the tool ÷ decisions it creates.

List every decision the tool makes for you. Then list every new decision it forces you to make. Divide the first by the second.

Thresholds:

Example: A code completion tool that writes a function body (replaces 5 decisions about syntax, structure, naming) but requires review (adds 2 decisions about correctness) has a ratio of 2.5. It passes.

A meeting summariser that replaces 1 decision (should I re-listen?) but creates 3 (verify accuracy, add context, decide what to share) has a ratio of 0.33. It fails.

Formula: time without the tool ÷ time with the tool.

Include onboarding time amortised over your first 10 uses. A tool that saves 30 minutes per use but took 2 hours to learn breaks even at 4 uses. After that, it is pure gain.

Threshold: Break-even within 5 uses.

Catch: This lens breaks for tools that enable tasks you could not do at all before. A drug discovery simulation has infinite Friction Delta because the alternative is impossible. Score those as "can't evaluate on this lens" and rely on the others.

Formula: output quality ÷ attention consumed.

Estimate cognitive load per use on a simple scale: 1 (fire and forget) to 4 (full attention required). Track whether it goes up or down over 10 uses.

Threshold: Attention per use should decrease over time. If you need to watch it more closely after ten uses than after one, something is wrong.

I tested this framework against the hardest cases I could find. It failed in five ways. Knowing them makes it useful:

Decision quality matters more than quantity. One high-stakes judgment (should I deploy?) outweighs 10 trivial picks (camelCase or snake_case?). Weight strategically.

Friction Delta can't measure capability expansion. If a tool lets you do something new rather than just faster, skip this lens.

Attention ROI rewards deskilling. The descending attention threshold is a Goodhart target — it rewards tools that train you to rubber-stamp.

Erasure cost is invisible. The framework never asks: if I use this for a year, what can I no longer do without it?

Error asymmetry is invisible. Two tools can score identically while producing catastrophically different results when they fail.

Ask: "If I use this tool for six months and then stop, what skill will I have lost?" Score it: 1 (nothing lost) to 4 (core competency outsourced). Score 1-2 is safe. Score 3 is a deliberate trade. Score 4 is dependency, not tooling.

This framework connects to a deeper structural principle: a tool's value is the difficulty it removes. If it creates new difficulty of a different kind, it is not a tool. It is a job.

Full framework with diagram: https://telegra.ph/The-Decision-Subtraction-Framework-How-to-Evaluate-Any-AI-Tool-05-28

source & further reading

dev.to — original article Your embedding axes can move while cosine neighbours stay put DDD & More: Is Software Design Dead in the Age of Automatons? Building Maestro AI: Routing LLM Calls So Your Agent Doesn't Burn Sonnet on Summaries

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-decision-subtraction…

Read original on dev.to → dev.to/harryfloyd/the-decision-subtraction-frame…

mentioned entities

ChatGPT

Claude

Grok

Cursor

Copilot

metadata

slugthe-decision-subtraction-framework-how-to-evaluate-any-ai-tool

topic#ai-tools

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevPixelizing My New Favicon

next →We Deleted 10 Real Users with a …

── more in #ai-tools 4 stories · sorted by recency

dev.to · 12 Jul · #ai-tools

TestSprite's CLI hands your coding agent one clean failure at a time

dev.to · 12 Jul · #ai-tools

Building Maestro AI: Routing LLM Calls So Your Agent Doesn't Burn Sonnet on Summaries

matthodges.com · 12 Jul · #ai-tools

A Magazine with One Subscriber

dev.to · 12 Jul · #ai-tools

Architecting MedReach AI: Solving Data Rot and Regulatory Bottlenecks in Pharma Marketing

── more on @chatgpt 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required