"I Stopped Pretending Every AI Provider Was the Same"

wpnews.pro

cd /news/ai-tools/i-stopped-pretending-every-ai-provid… · home › topics › ai-tools › article

[ARTICLE · art-34822] src=dev.to ↗ pub=2026-06-20T12:10Z topic=ai-tools verified=true sentiment=· neutral

"I Stopped Pretending Every AI Provider Was the Same"

A developer building CliGate, a local control plane for multiple AI coding tools, discovered that treating all AI providers as interchangeable leads to subtle failures. The project shifted from universal passthrough routing to capability-aware routing, where protocol translation is integrated into routing decisions. This approach prevents silent bugs by normalizing fields based on each provider's actual capabilities.

read3 min views1 publishedJun 20, 2026

The easiest way to make an AI gateway feel flaky is to pretend every upstream model works the same way.

On paper, a lot of tools look compatible.

They all take a prompt. They all return text. Some of them even share an OpenAI-shaped API.

In practice, the differences show up exactly where users stop forgiving you:

That was one of the most useful lessons while building CliGate, my local control plane for Claude Code, Codex CLI, Gemini CLI, OpenClaw, a resident assistant, and multiple model/account sources behind one localhost entrypoint.

The bug was subtler than that.

Routing often did succeed. A request got sent somewhere. A response came back. Nothing obviously crashed.

But that did not mean the gateway was correct.

If you route different tools and providers as if they were interchangeable, you get a class of failures that are hard to spot from logs alone:

That is not just transport routing.

That is capability routing.

At first, it is tempting to think routing is just:

pick provider -> send request

That model is too small.

What actually mattered in CliGate was closer to this:

identify caller/tool
-> identify protocol shape
-> resolve provider/model source
-> apply capability profile
-> translate or degrade fields safely
-> send upstream

A provider being reachable is not enough.

It also needs to be treated according to the features it really supports.

One of the more useful internal lessons in this project is that protocol translation is not a separate cleanup step after routing.

It is part of routing.

Some paths can accept a richer request shape. Some need fields normalized or stripped before the request becomes a silent bug.

That changed the safe mental model from:

“upstream did not complain, so the route must be fine.”

to:

“this route supports a specific capability profile, so normalize on purpose.”

That sounds small, but it prevents a lot of “works sometimes” behavior.

This is the trap.

Lots of systems advertise compatibility because they accept a familiar endpoint shape.

But compatibility at the HTTP layer is only the beginning.

If one tool expects richer reasoning or metadata semantics and another backend treats those fields differently, the gateway has three bad choices:

Only the third one scales.

That is why I now prefer capability-aware routing over a universal passthrough design.

claude-code

, codex

, gemini-cli

, openclaw

, and generic OpenAI/Anthropic-compatible clients may hit similar-looking routes, but they are not interchangeable from an operator’s perspective.

The user is often really asking for one of these:

That is why app-aware routing and capability-aware translation ended up being complementary, not separate concerns.

One decides who this request is for.

The other decides how to make it truthful on the way through.

The worst failures are the accidental ones.

If a gateway quietly forwards a field that the destination ignores, the user may never know why results became inconsistent.

So I started preferring explicit degradation rules.

If a route cannot honor a field, normalize it on purpose.

If a provider cannot match a capability, map it honestly.

If a model source is rate-limited or invalid, skip it instead of pretending all active-looking credentials are equal.

That gives me a much better operator story:

A good gateway should hide repetitive setup work.

It should not lie about capability differences.

Once I accepted that, the architecture became cleaner:

That is less magical, but much more dependable.

If I were designing another AI gateway tomorrow, I would keep these rules:

That is the direction I have been pushing with CliGate.

The project still aims to give me one local place for model routing, accounts, API keys, local runtimes, channels, runtime sessions, and an assistant layer.

But the system became much more trustworthy once I stopped pretending every upstream provider was the same.

If you run multiple AI tools through one gateway, are you doing plain endpoint routing, or routing by actual capability too?

source & further reading

dev.to — original article SpaceX AI1 Orbital Data Center: 1 GW of Space AI Compute by 2027, Developer Guide AI for GitLab CI Authoring: Save Hours, Avoid Footguns Elixir 1.20 has a type system now: comparing it with Rust and TypeScript

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-stopped-pretending-eve…

Read original on dev.to → dev.to/codekingai/i-stopped-pretending-every-ai-…

mentioned entities

CliGate

Claude Code

Codex CLI

Gemini CLI

OpenClaw

OpenAI

Anthropic

metadata

slugi-stopped-pretending-every-ai-provider-was-the-same

topic#ai-tools

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevShow HN: Lelu – authorization en…

next →Elixir 1.20 has a type system no…

── more in #ai-tools 4 stories · sorted by recency

dev.to · 20 Jun · #ai-tools

What Is SKILL.md? A Practical Guide to AI Agent Skills

github.com · 20 Jun · #ai-tools

How to sync messages of Claude Code extension in VS Code and Claude Code app?

agenticcodingweekly.com · 20 Jun · #ai-tools

Talking to My Terminal with Local Speech-to-Text and Pi Coding Agent

runtimewire.com · 20 Jun · #ai-tools

Elon Musk takes Grok into Databricks as xAI chases enterprise distribution

── more on @cligate 3 stories trending now

wpnews · 19 Jun · #artificial-intelligence

From Dream Job to 'The Gulag': Inside Staff Revolt Zuckerberg's Brutal AI Push

wpnews · 19 Jun · #artificial-intelligence

Stop Guessing Which Library to Use — I Built an AI Capability Discovery Engine

wpnews · 19 Jun · #large-language-models

I Cut My AI Agent's Token Bill by 62% in One Weekend. Here's the Receipts.

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required