capricorn86

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

19:57

2026-05-27

deepswe.datacurve.ai

ai-agents

DeepSWE Measuring frontier coding agents

DataCurve released DeepSWE, a new benchmark for evaluating frontier coding agents on original, long-horizon software engineering tasks. The benchmark features contamination-free tasks written from scr…

// co-occurs with top 5 entities

DeepSWE 1 DataCurve 1 SWE-bench 1 mini-swe-agent 1 happy-dom 1