{"type": "article", "title": "Constructing Evaluation Datasets for Procedural Reasoning: Balancing Naturalness, Grounding, and Multi-Hop Coverage", "publisher": "Web Pulse", "url": "https://wpnews.pro/news/constructing-evaluation-datasets-for-procedural-reasoning-balancing-naturalness", "original_source": "https://arxiv.org/abs/2606.12767", "published": "2026-06-12T04:00:00+00:00", "accessed": "2026-06-13", "id": "constructing-evaluation-datasets-for-procedural-reasoning-balancing-naturalness"}