Gemma4:e2B

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

02:59

2026-05-28

arxiv.org

large-language-models

Harness Sensitivity Is Non-Monotone Across LLM Agent Tiers

A new study from researchers testing six large language models across four capability tiers found that the relationship between model capability and optimal harness complexity is non-monotone, contrad…

// co-occurs with top 3 entities

Gemini 2.5 Flash 1 Qwen3.5-122B 1 HEAT-24 1