Show HN: Ctx, save tokens by loading only the relevant tools

wpnews.pro

cd /news/developer-tools/show-hn-ctx-save-tokens-by-loading-o… · home › topics › developer-tools › article

[ARTICLE · art-30010] src=github.com ↗ pub=2026-06-16T18:14Z topic=developer-tools verified=true sentiment=↑ positive

Show HN: Ctx, save tokens by loading only the relevant tools

Developer Steve Solun released Ctx, an open-source tool that reduces token costs by selecting only relevant skills, agents, MCP servers, and harnesses before a session begins, rather than compressing data after loading. The tool uses a curated graph of 91k+ skills and other resources to recommend a focused bundle based on repo and task context, aiming to save tokens without manual comparison.

read2 min views25 publishedJun 16, 2026

Hi HN!

Token cost has started to become a high topic of concern to all of us. I tried a few (awesome) tools such as rtk, caveman, and the recent (hillarious but effective) ponytail. What they usually do, is in-line token reduction, e.g. try to compress requests / responses as much as possible.

But then it hit me (and I’m sure others had similar ideas) - just like we have routers that pick the right model, why not have something that will also narrow down the amount of available tools, skills and mcps based on repo/context?

People usually accumulate skills, agents, MCP servers, harnesses, prompts, repo instructions, and local scripts. I’m not saying we are all hoarders, but we sort of are. When did you remove a skill recently? After a while, the model has way too many options to choose from.

ctx tries to fix that by selecting context before the session gets bloated.So no, it doesn’t cleanup your messy garage, but it gives you magic glasses that let you focus only on the tools you need.

It does it by watching the repo and task, walks a graph of available tooling, and recommends a small top-scored bundle of skills, agents, MCP servers, and harnesses.

How does it know? To make sure results are not hallucinated, and repeatable, I curated a list of 91k+ skills, 467 agents, 10.7k MCP servers, 207 harnesses, and built a graph to help ctx make decisions on what to recommend. While I used AI to generate it of course, I curated it and revised it to make sure the data is up to date.

So how this is different from rtk, caveman, ponytail, and similar token-saving tools?

As mentioned above those tools mostly reduce tokens after something is already being used.

rtk compresses command output.

caveman-style tools make the assistant respond with fewer words.

ponytail, is, well, awesome, but again it focuses more on reducing code (YAGNI)

ctx is upstream. It tries to avoid irrelevant skills, agents, MCPs, and harnesses into context at all.

So it is not really a replacement. It should work side by side with them!

Use ctx to choose the right tools. Use rtk to reduce terminal-output noise. Use terse-output tools if you want shorter responses. The goal is simple: save tokens without forcing the user to manually test and compare thousands of possible skills, agents, MCP servers, and harnesses.

Repo: [https://github.com/stevesolun/ctx](https://github.com/stevesolun/ctx)

Comments URL: [https://news.ycombinator.com/item?id=48559559](https://news.ycombinator.com/item?id=48559559)

Points: 2

source & further reading

github.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/show-hn-ctx-save-tokens-…

Read original on github.com → github.com/stevesolun/ctx

mentioned entities

Steve Solun

Ctx

rtk

caveman

ponytail

MCP

GitHub

metadata

slugshow-hn-ctx-save-tokens-by-loading-only-the-relevant-tools

topic#developer-tools

secondary2 topics

sentimentpositive

canonicalgithub.com

navigation

← prevEquinix Expands Cisco + NVIDIA S…

next →California mother, boyfriend ple…

── more in #developer-tools 4 stories · sorted by recency

byteiota.com · 1 Aug · #developer-tools

Microsoft Flint Fixes AI Agent Charts With a Compiler

dev.to · 1 Aug · #developer-tools

Our dev labs open-sourced a local Python middleware framework that intercepts, repairs, and stabilizes malformed AI JSON data streams within local in-memory arrays.

dev.to · 1 Aug · #developer-tools

Optimizing LLM Stream Ingestion: Reconstructing Truncated JSON Payloads in 0.0122ms

dev.to · 1 Aug · #developer-tools

I Stopped My AI From Making Up AWS Answers With One Command

── more on @steve solun 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required