I used autoresearch to improve my AGENTS.md, measured against real tasks
A developer used Codex to iteratively improve an AGENTS.md file across eight versions, measuring each against real pull requests on a five-task training set. The best candidate showed improvements in β¦