21:48
2026-06-15
arxiv.org
large-language-models
DPBench: Structural Determinants of Multi-Agent LLM Coordination
Researchers introduced DPBench, a benchmark evaluating coordination in multi-agent LLM systems, finding that protocol structure—not model capability—determines deadlock rates. GPT-5.2 achieved 25% dea…