04:00
2026-05-27
arxiv.org
large-language-models
Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions
Large language models (LLMs) show reduced accuracy on math problems when simple details like names or numbers are changed, and a new study finds that using code execution methods does not improve thisβ¦