DoorKey-8x8 — Web Pulse coverage When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL :: https://wpnews.pro/news/when-llm-reward-design-fails-diagnostic-driven-refinement-for-sparse-structured