00:00
2026-06-13
research.rudrite.com
large-language-models
GenPRM: Generative Process Reward Models β interactive visual explainer | Rudrite Research
Zhao et al. published GenPRM, a generative process reward model that reasons and runs code to verify each step, achieving state-of-the-art performance where a 7B parameter model outperforms a 72B paraβ¦