04:00
2026-05-25
arxiv.org
large-language-models
GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models
Researchers have developed GENSTRAT, a system that uses procedurally generated card games to evaluate strategic reasoning in large language models (LLMs). In a tournament of over 36,000 matches acrossβ¦