{"type": "article", "title": "Teaching to the Test: Why Reward Models Learn the Dataset, Not the Values", "publisher": "Web Pulse", "url": "https://wpnews.pro/news/teaching-to-the-test-why-reward-models-learn-the-dataset-not-the-values", "original_source": "https://pub.towardsai.net/teaching-to-the-test-why-reward-models-learn-the-dataset-not-the-values-3415281e9e57?source=rss----98111c9905da---4", "published": "2026-06-22T03:58:32+00:00", "accessed": "2026-06-22", "id": "teaching-to-the-test-why-reward-models-learn-the-dataset-not-the-values"}