04:00
2026-06-04
arxiv.org
ai-agents
Can Generalist Agents Automate Data Curation?
Researchers introduced Curation-Bench, a benchmark testing whether generalist coding agents can automate the labor-intensive process of curating AI training data. Out-of-the-box agents matched strong โฆ