How to switch AI models without rewriting your app

wpnews.pro

cd /news/large-language-models/how-to-switch-ai-models-without-rewr… · home › topics › large-language-models › article

[ARTICLE · art-43166] src=dev.to ↗ pub=2026-06-29T08:47Z topic=large-language-models verified=true sentiment=· neutral

How to switch AI models without rewriting your app

A developer from TokenBay demonstrates how to switch AI models without rewriting application code by using an OpenAI-compatible API gateway. The approach allows developers to keep the familiar OpenAI SDK style while routing requests to different model families through a single endpoint. Changing the base URL and API key enables testing of models like Claude, Gemini, and DeepSeek with minimal code changes.

read3 min views1 publishedJun 29, 2026

Most AI apps start with one model provider.

That is usually the right choice. For a first version, you want one SDK, one API key, one billing page, and one model name. Simple is good when you are trying to ship.

But once the product grows, the model decision gets more complicated.

You may want to test another model because:

The annoying part is that switching models is often not just changing a string.

It can mean adding another SDK, another API key, another request format, another dashboard, and another set of provider-specific edge cases.

That gets messy quickly.

A first version might look like this:

python
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_OPENAI_API_KEY",
)

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {
            "role": "user",
            "content": "Write a short onboarding message for a developer tool."
        }
    ],
)

print(response.choices[0].message.content)

This is clean and totally fine.

But if you later want to compare Claude, Gemini, DeepSeek, or another model family, you may not want to rewrite your AI integration around each provider.

One practical option is to use an OpenAI-compatible API gateway.

Your app keeps using the OpenAI SDK style, but the gateway lets you route requests to different model families through one endpoint.

I work on the TokenBay team, so the example below uses TokenBay. The general idea applies to any OpenAI-compatible gateway.

python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.tokenbay.com/v1",
    api_key="YOUR_TOKENBAY_API_KEY",
)

response = client.chat.completions.create(
    model="gpt-5.4-mini",
    messages=[
        {
            "role": "user",
            "content": "Write a short onboarding message for a developer tool."
        }
    ],
)

print(response.choices[0].message.content)

The main change is just:

python
base_url="https://api.tokenbay.com/v1"
api_key="YOUR_TOKENBAY_API_KEY"

That is the useful part.

You keep the familiar OpenAI client shape, but you are no longer wiring every provider separately.

Once your app uses an OpenAI-compatible endpoint, testing another supported model can be as simple as changing the model name.

python
response = client.chat.completions.create(
    model="claude-sonnet-4.6",
    messages=[
        {
            "role": "user",
            "content": "Write a short onboarding message for a developer tool."
        }
    ],
)

Or:

python
response = client.chat.completions.create(
    model="gemini-2.5-pro",
    messages=[
        {
            "role": "user",
            "content": "Write a short onboarding message for a developer tool."
        }
    ],
)

The point is not that every model behaves the same.

They do not.

The point is that your business logic should not need to change every time you want to compare models.

For a real app, I would keep the base URL, API key, and model name in environment variables:

bash
LLM_BASE_URL=https://api.tokenbay.com/v1
LLM_API_KEY=YOUR_TOKENBAY_API_KEY
LLM_MODEL=gpt-5.4-mini

Then your application code can stay stable while you test different models.

Change config, redeploy, compare results.

Very boring. Very useful.

This setup is useful if you are:

It does not magically solve model selection. You still need to test output quality, latency, pricing, context length, and reliability.

But it does make the integration layer much simpler.

A gateway is not always the right choice.

Direct provider integration may be better if:

That is a fair tradeoff.

The point is not "always use a gateway."

The point is this:

If you are going to test multiple models anyway, your app should not need a rewrite every time.

TokenBay is an OpenAI-compatible API gateway for accessing models such as GPT, Claude, Gemini, DeepSeek, and others through one endpoint and one API key.

It includes:

If you want to test this pattern, you can try TokenBay here:

[Try TokenBay]https://www.tokenbay.com/?utm_source=devto&utm_medium=community_content&utm_campaign=week1_free_content

Current launch offer:

I would love feedback from builders:

source & further reading

dev.to — original article If You Know CSS, You Already Know Yumma CSS Coding agents need file boundaries, not better manners The cost of learning everyting

~/api · this article 200

$curl api.wpnews.pro/v1/news/how-to-switch-ai-models-…

Read original on dev.to → dev.to/gwenj/how-to-switch-ai-models-without-rew…

mentioned entities

TokenBay

OpenAI

Claude

Gemini

DeepSeek

GPT

metadata

slughow-to-switch-ai-models-without-rewriting-your-app

topic#large-language-models

secondary2 topics

sentimentneutral

canonicaldev.to

navigation

← prevHow Long Until Your AI Edge Stop…

next →Best AI Tools for SaaS Free Tria…

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 29 Jun · #large-language-models

LLM Medical Triage: Same Symptoms, Gender-Dependent Urgency

dev.to · 29 Jun · #large-language-models

I asked four AI models which database to use in 2026. Neon already won. Four challengers are invisible.

dev.to · 29 Jun · #large-language-models

ARK Trust: The Missing Reliability Layer for AI Agents

dev.to · 29 Jun · #large-language-models

The Loadout Pattern: Handing the Wheel to an Autonomous LLM

── more on @tokenbay 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 Jun · #ai-agents

OpenCode v1.17: Session Snapshots Undo Your AI Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required