Lite-Harness SDK

wpnews.pro

cd /news/ai-tools/lite-harness-sdk · home › topics › ai-tools › article

[ARTICLE · art-39288] src=dev.to ↗ pub=2026-06-25T12:37Z topic=ai-tools verified=true sentiment=↑ positive

Lite-Harness SDK

LiteLLM launched the Lite-Harness SDK, enabling developers to swap between AI agent harnesses such as Claude Code and Codex without rewriting application code. The SDK provides a unified query interface, supports cost controls and logging via the LiteLLM AI Gateway, and allowed one team to save three weeks of integration work while A/B testing different harnesses in production.

read2 min views1 publishedJun 25, 2026

AI harnesses are the new vendor lock-in. To swap across harnesses easily without rewriting your app, LiteLLM launched the Lite-Harness SDK.

Run your prompt across different harnesses:

from lite_harness import query, AgentOptions

prompt = "Fix the failing test"

async for message in query(
    prompt=prompt,
    options=AgentOptions(harness="claude-code", model="claude-opus-4-8"),
):
    print(message)

async for message in query(
    prompt=prompt,
    options=AgentOptions(harness="codex", model="gpt-5.5"),
):
    print(message)

To enable cost controls, fallbacks, and logging, point it to your LiteLLM AI Gateway:

export LITELLM_API_BASE=https://litellm.your-company.com/v1
export LITELLM_API_KEY=sk-litellm-...

Engineer's Takeaway:

This SDK unifies how you invoke the agents, not how they run internally. Each harness keeps its native loop and tool-calling semantics. It is perfect for A/B testing agent performance and centralizing costs, but remember it is in public beta, so custom tool injection might require extra work!

My team was building an internal bot to fix failing CI/CD tests. We had three engineers advocating for three different harnesses: one wanted Claude Code, another Codex, and another Pi AI. Without an abstraction layer, we would have had to maintain three forks of the same bot, with three different SDKs, three logging systems, and three ways to track costs. It would have been an impossible maintenance burden.

The SDK solved that exact pain point in three concrete dimensions:

Instead of maintaining three separate implementations, I had a single query() that routed to whichever harness I wanted. Switching from Claude Code to Codex was literally just changing a string in the options. This allowed us to do real A/B testing in production for two weeks without rewriting any core logic.

By connecting it to the LiteLLM AI Gateway, I could suddenly see on a single dashboard:

Without the gateway, tracking the real cost of an agent (which makes multiple sequential tool calls) is a nightmare of scattered logs.

When Anthropic released new capabilities in Claude Opus 4.8, I just updated the model string. I didn't have to touch the bot's underlying code. That's the real promise of LiteLLM: decoupling your application from the provider.

max_iterations

, an agent can burn $5 in tokens if it gets stuck in an infinite loop. I had to wrap the query()

call in an asyncio.wait_for

with a strict timeout to protect our budget.Lite-Harness probably saved me 3 weeks of integration work and gave me hard data to make an informed architecture decision. We ended up choosing Claude Code as our primary harness and Codex as a fallback for simpler, cost-sensitive tasks.

source & further reading

dev.to — original article AI Dev Weekly #16: Mistral OCR 4, Claude Tag, Alibaba Caught Stealing, GPT-5.6 Delayed The Frontend Is Becoming a Conversation: Where UI Engineering Goes Next I Built a Log Monitoring Script with DeepSeek — Here is What Went Wrong

~/api · this article 200

$curl api.wpnews.pro/v1/news/lite-harness-sdk

Read original on dev.to → dev.to/jeancarlosn/lite-harness-sdk-3f28

mentioned entities

LiteLLM

Claude Code

Codex

Anthropic

Pi AI

metadata

sluglite-harness-sdk

topic#ai-tools

secondary3 topics

sentimentpositive

canonicaldev.to

navigation

← prevFast Netgear WiFi 7 router plumm…

next →The Frontend Is Becoming a Conve…

── more in #ai-tools 4 stories · sorted by recency

polimetro.com · 25 Jun · #ai-tools

What Is Claude Code's Automatic Mode

github.com · 25 Jun · #ai-tools

Show HN: Create animated explainer video from a prompt

promptql.io · 25 Jun · #ai-tools

A Teardown of Claude Tag's Agent Identity Concept

dev.to · 25 Jun · #ai-tools

The 47 Words That Make Your Text Sound AI-Written (And What to Use Instead)

── more on @litellm 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 24 Jun · #ai-policy

An AI startup is suing the US government for taking away Anthropic's new model

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required