04:00
2026-05-27
arxiv.org
ai-agents
Anchor: Mitigating Artifact Drift in Agent Benchmark Generation
Researchers introduced Anchor, a task-generation pipeline that prevents artifact drift in AI agent benchmarks by formalizing business workflow specifications into constraint optimization programs. Theβ¦