cd/entity/SWE-Bench-Proยท homeโ€บ entitiesโ€บ SWE-Bench-Pro
grep -l @swe-bench-pro /news/*.json | wc -l โ†’ 1

@SWE-Bench-Pro

mentions 1 type Organization feed RSS
12:43
2026-05-29
marginlab.ai
large-language-models

Claude Code Degraded Before Opus 4.8 Release

Anthropic's Claude Code agent suffered a statistically significant five-day performance degradation immediately before the Opus 4.8 model release, with pass rates dropping from a 65% baseline to as loโ€ฆ

// co-occurs with top 3 entities