04:00
2026-06-15
arxiv.org
artificial-intelligence
Hybrid Open-Ended Tri-Evolution Makes Better Deep Researcher
Researchers propose the Hybrid Open-Ended Tri-Evolution (HOTE) framework, which uses hybrid-mode reinforcement learning to evolve a proposer, solver, and judge collaboratively for deep research tasks.โฆ