04:00
2026-06-17
arxiv.org
large-language-models
Can LLMs Be CEOs? Benchmarking Strategic Resource Reallocation with Multi-Role Agent Simulation
Researchers introduced CEO-Bench, a multi-agent benchmark evaluating large language models on strategic resource reallocation across business units. Testing five frontier models revealed high structurβ¦