09:32
2026-07-03
lesswrong.com
large-language-models
Fragile Correctness: Cases of reasoning harming performance
A new study reveals that increased reasoning in AI models can sometimes reduce accuracy, a phenomenon termed 'fragile correctness.' Researchers found that 14.9% of answers switched from correct to incβ¦