Step-by-Step Coding

mentions 1 type Person feed RSS

// recent coverage 1 mentions

04:00

2026-05-27

arxiv.org

large-language-models

Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions

Large language models (LLMs) show reduced accuracy on math problems when simple details like names or numbers are changed, and a new study finds that using code execution methods does not improve this…

// co-occurs with top 4 entities

Claude Haiku 4.5 1 GSM-Symbolic 1 Chain-of-Thought 1 Program-Aided Language models 1