04:00
2026-05-27
arxiv.org
machine-learning
GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training
Researchers have developed GAC, a noise-aware adaptive mixing controller for hybrid post-training that combines supervised fine-tuning and reinforcement learning. The method dynamically adjusts the miโฆ