{"slug": "regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc", "title": "Regressive Plasticity Schedule: A Two-Stage Post-Training Schedule for ARC Program Synthesis", "summary": "Researchers introduced Regressive Plasticity Schedule (RPS), a two-stage post-training schedule that couples a learning-rate drop with a curriculum boundary between easier and harder data. Testing on Qwen3-8B with ARC-AGI-1 tasks, RPS improved exact test-output accuracy from 10/419 to 17/419, and on ARC-AGI-2, it increased program-synthesis reliability from 188/240 to 234/240 error-free executions. The findings suggest RPS can enhance general reasoning and stability in program synthesis.", "body_md": "Paper: [GitHub - iamjasonfeng/RPS-Paper · GitHub](https://github.com/iamjasonfeng/RPS-Paper)\n\nThis paper presents Regressive Plasticity Schedule (RPS), a two-stage post-training schedule inspired by developmental plasticity. RPS combines two familiar ideas, curriculum learning and learning-rate reduction, in a specific way: the model is first trained on easier data at a higher learning rate, then trained on harder data at a substantially lower learning rate. This differs from ordinary learning-rate decay because the main intervention is not merely reducing the optimizer step size over time within one training stage; instead, RPS couples a discrete stage-level learning-rate drop to a curriculum boundary between easier and harder data. The broader motivation for RPS is to improve general reasoning: program synthesis is an important testbed here, but the more important question is whether staged plasticity can help models preserve foundational reasoning behaviors while adapting to harder domains. I tested RPS on Qwen3-8B using Alibaba Model Studio managed DPO fine-tuning with LoRA. The control condition, Equal Plasticity Schedule (EPS), used the same model, same two-stage data structure, same within-stage cosine scheduler, and same Stage 1 checkpoint, but did not reduce the Stage 2 learning rate. On ARC-AGI-1 public evaluation, RPS improved exact test-output accuracy from 10/419 to 17/419, which provides evidence of improved ARC-style general reasoning because these tasks require inferring and applying latent transformation rules from few examples. On ARC-AGI-2 public evaluation, neither RPS nor EPS solved any test outputs, but RPS substantially improved program-synthesis reliability: 234/240 attempted programs executed without error for RPS, compared with 188/240 for EPS. The result does not show that RPS solves ARC-AGI-2, but it suggests that a curriculum-coupled plasticity reduction can improve ARC-style reasoning behavior and make a model more stable at producing usable reasoning artifacts. If this pattern generalizes beyond ARC, RPS could have large potential as a simple post-training schedule for improving broader reasoning behavior.", "url": "https://wpnews.pro/news/regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc", "canonical_source": "https://discuss.huggingface.co/t/regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc-program-synthesis/176922#post_1", "published_at": "2026-06-18 02:53:22+00:00", "updated_at": "2026-06-18 02:58:28.502662+00:00", "lang": "en", "topics": ["machine-learning", "large-language-models", "artificial-intelligence", "ai-research"], "entities": ["Qwen3-8B", "Alibaba Model Studio", "ARC-AGI-1", "ARC-AGI-2", "Regressive Plasticity Schedule", "Equal Plasticity Schedule"], "alternates": {"html": "https://wpnews.pro/news/regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc", "markdown": "https://wpnews.pro/news/regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc.md", "text": "https://wpnews.pro/news/regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc.txt", "jsonld": "https://wpnews.pro/news/regressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc.jsonld"}}