Sakana Trained One AI to Command GPT-5.5,

wpnews.pro

cd /news/artificial-intelligence/sakana-trained-one-ai-to-command-gpt… · home › topics › artificial-intelligence › article

[ARTICLE · art-37305] src=pub.towardsai.net ↗ pub=2026-06-24T04:51Z topic=artificial-intelligence verified=true sentiment=↑ positive

Sakana Trained One AI to Command GPT-5.5,

A Tokyo lab released an AI model that achieved a score of 73.7 on SWE-Bench Pro, outperforming Opus 4.8 (69.2) and GPT-5.5 (58.6), signaling a significant advancement in AI capabilities.

read1 min views5 publishedJun 24, 2026

Two days ago a Tokyo lab shipped a model that scored 73.7 on SWE-Bench Pro. Opus 4.8 gets 69.2 on the same test. GPT-5.5 gets 58.6. Gemini… Continue reading on Towards AI »

source & further reading

pub.towardsai.net — original article The 3B Model Going Toe to Toe with Opus 4.5 In Maths and Coding Substrate-Bound Coupling in Human-LLM Interaction LAI #131: A Tool Call Can Succeed and Still Be the Wrong Tool

~/api · this article 200

$curl api.wpnews.pro/v1/news/sakana-trained-one-ai-to…

Read original on pub.towardsai.net → pub.towardsai.net/sakana-trained-one-ai-to-comma…

mentioned entities

Sakana

Opus 4.8

GPT-5.5

Gemini

metadata

slugsakana-trained-one-ai-to-command-gpt-5-5

topic#artificial-intelligence

secondary2 topics

sentimentpositive

canonicalpub.towardsai.net

navigation

← prevOnce an AI Agent Removes Typing,…

next →Your LLM Obeys 99% of the Time. …

── more in #artificial-intelligence 4 stories · sorted by recency

the-decoder.com · 25 Jun · #artificial-intelligence

Most major AI chatbots still lean left on political questions, even "anti-woke" models are no exception

9to5google.com · 25 Jun · #artificial-intelligence

Google Finance now available as dedicated Android app

byteiota.com · 25 Jun · #artificial-intelligence

GLM-5.2 Beats GPT-5.5 at Coding for One-Sixth the Price

the-decoder.com · 25 Jun · #artificial-intelligence

Google bakes computer control directly into Gemini 3.5 Flash, letting the model see and operate your screen

── more on @sakana 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required