A Primer in Post-Training Reasoning Data: What We Know About How It Works
A new primer synthesizes over 150 studies and system reports on post-training reasoning data for large language models, organizing the field around four key questions: what data objects exist, what ma…