Show HN: Our Claude Code Plugin Routes Lightweight AI Tasks to Specialized SLMs A developer built a Claude Code plugin that routes lightweight AI tasks to specialized small language models (SLMs) to reduce compute costs. The plugin intercepts requests and directs simpler queries to smaller, cheaper models while reserving larger models for complex tasks. This approach can significantly lower inference expenses without sacrificing performance on routine operations. Article URL: https://medium.com/zerogpu/how-to-reduce-ai-compute-costs-with-our-claude-code-plugin-routing-lightweight-ai-tasks-to-small-2a265e19c699 Comments URL: https://news.ycombinator.com/item?id=48574189 Points: 1 Comments: 0