Sakana Trained One AI to Command GPT-5.5,

A Tokyo lab released an AI model that achieved a score of 73.7 on SWE-Bench Pro, outperforming Opus 4.8 (69.2) and GPT-5.5 (58.6), signaling a significant advancement in AI capabilities.

Two days ago a Tokyo lab shipped a model that scored 73.7 on SWE-Bench Pro. Opus 4.8 gets 69.2 on the same test. GPT-5.5 gets 58.6. Gemini… Continue reading on Towards AI »