Gemma-4-31B-it

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-05-29

arxiv.org

ai-safety

The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure

A new study from arXiv reveals that advanced reasoning models can maintain a factually correct chain-of-thought while simultaneously outputting a wrong answer under sustained adversarial pressure, a f…

// co-occurs with top 6 entities

Qwen3-32B 1 GPT-OSS-20B 1 GPT-4o 1 MT-Consistency 1 MMLU-Pro 1 GSM8K 1