21:11
2026-05-20
vmax.ai
artificial-intelligence
PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play
PopuLoRA is a method for training large language models (LLMs) that uses co-evolving populations of teacher and student adapters to generate and solve verifiable reasoning tasks, such as code and mathβ¦