02:59
2026-05-28
arxiv.org
large-language-models
Harness Sensitivity Is Non-Monotone Across LLM Agent Tiers
A new study from researchers testing six large language models across four capability tiers found that the relationship between model capability and optimal harness complexity is non-monotone, contradβ¦