cd /news/machine-learning/regressive-plasticity-schedule-a-two… · home topics machine-learning article
[ARTICLE · art-32000] src=discuss.huggingface.co ↗ pub= topic=machine-learning verified=true sentiment=↑ positive

Regressive Plasticity Schedule: A Two-Stage Post-Training Schedule for ARC Program Synthesis

Researchers introduced Regressive Plasticity Schedule (RPS), a two-stage post-training schedule that couples a learning-rate drop with a curriculum boundary between easier and harder data. Testing on Qwen3-8B with ARC-AGI-1 tasks, RPS improved exact test-output accuracy from 10/419 to 17/419, and on ARC-AGI-2, it increased program-synthesis reliability from 188/240 to 234/240 error-free executions. The findings suggest RPS can enhance general reasoning and stability in program synthesis.

read2 min views1 publishedJun 18, 2026

Paper: GitHub - iamjasonfeng/RPS-Paper · GitHub This paper presents Regressive Plasticity Schedule (RPS), a two-stage post-training schedule inspired by developmental plasticity. RPS combines two familiar ideas, curriculum learning and learning-rate reduction, in a specific way: the model is first trained on easier data at a higher learning rate, then trained on harder data at a substantially lower learning rate. This differs from ordinary learning-rate decay because the main intervention is not merely reducing the optimizer step size over time within one training stage; instead, RPS couples a discrete stage-level learning-rate drop to a curriculum boundary between easier and harder data. The broader motivation for RPS is to improve general reasoning: program synthesis is an important testbed here, but the more important question is whether staged plasticity can help models preserve foundational reasoning behaviors while adapting to harder domains. I tested RPS on Qwen3-8B using Alibaba Model Studio managed DPO fine-tuning with LoRA. The control condition, Equal Plasticity Schedule (EPS), used the same model, same two-stage data structure, same within-stage cosine scheduler, and same Stage 1 checkpoint, but did not reduce the Stage 2 learning rate. On ARC-AGI-1 public evaluation, RPS improved exact test-output accuracy from 10/419 to 17/419, which provides evidence of improved ARC-style general reasoning because these tasks require inferring and applying latent transformation rules from few examples. On ARC-AGI-2 public evaluation, neither RPS nor EPS solved any test outputs, but RPS substantially improved program-synthesis reliability: 234/240 attempted programs executed without error for RPS, compared with 188/240 for EPS. The result does not show that RPS solves ARC-AGI-2, but it suggests that a curriculum-coupled plasticity reduction can improve ARC-style reasoning behavior and make a model more stable at producing usable reasoning artifacts. If this pattern generalizes beyond ARC, RPS could have large potential as a simple post-training schedule for improving broader reasoning behavior.

── more in #machine-learning 4 stories · sorted by recency
── more on @qwen3-8b 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/regressive-plasticit…] indexed:0 read:2min 2026-06-18 ·