Regressive Plasticity Schedule: A Two-Stage Post-Training Schedule for ARC Program Synthesis

wpnews.pro

cd /news/machine-learning/regressive-plasticity-schedule-a-two… · home › topics › machine-learning › article

[ARTICLE · art-32000] src=discuss.huggingface.co ↗ pub=2026-06-18T02:53Z topic=machine-learning verified=true sentiment=↑ positive

Regressive Plasticity Schedule: A Two-Stage Post-Training Schedule for ARC Program Synthesis

Researchers introduced Regressive Plasticity Schedule (RPS), a two-stage post-training schedule that couples a learning-rate drop with a curriculum boundary between easier and harder data. Testing on Qwen3-8B with ARC-AGI-1 tasks, RPS improved exact test-output accuracy from 10/419 to 17/419, and on ARC-AGI-2, it increased program-synthesis reliability from 188/240 to 234/240 error-free executions. The findings suggest RPS can enhance general reasoning and stability in program synthesis.

read2 min views33 publishedJun 18, 2026

Paper: GitHub - iamjasonfeng/RPS-Paper · GitHub This paper presents Regressive Plasticity Schedule (RPS), a two-stage post-training schedule inspired by developmental plasticity. RPS combines two familiar ideas, curriculum learning and learning-rate reduction, in a specific way: the model is first trained on easier data at a higher learning rate, then trained on harder data at a substantially lower learning rate. This differs from ordinary learning-rate decay because the main intervention is not merely reducing the optimizer step size over time within one training stage; instead, RPS couples a discrete stage-level learning-rate drop to a curriculum boundary between easier and harder data. The broader motivation for RPS is to improve general reasoning: program synthesis is an important testbed here, but the more important question is whether staged plasticity can help models preserve foundational reasoning behaviors while adapting to harder domains. I tested RPS on Qwen3-8B using Alibaba Model Studio managed DPO fine-tuning with LoRA. The control condition, Equal Plasticity Schedule (EPS), used the same model, same two-stage data structure, same within-stage cosine scheduler, and same Stage 1 checkpoint, but did not reduce the Stage 2 learning rate. On ARC-AGI-1 public evaluation, RPS improved exact test-output accuracy from 10/419 to 17/419, which provides evidence of improved ARC-style general reasoning because these tasks require inferring and applying latent transformation rules from few examples. On ARC-AGI-2 public evaluation, neither RPS nor EPS solved any test outputs, but RPS substantially improved program-synthesis reliability: 234/240 attempted programs executed without error for RPS, compared with 188/240 for EPS. The result does not show that RPS solves ARC-AGI-2, but it suggests that a curriculum-coupled plasticity reduction can improve ARC-style reasoning behavior and make a model more stable at producing usable reasoning artifacts. If this pattern generalizes beyond ARC, RPS could have large potential as a simple post-training schedule for improving broader reasoning behavior.

source & further reading

discuss.huggingface.co — original article Rakarrack-0.6.1 port making progress! ( AI assisted ) Cloud Storage Poll Welcome to Haiku basic(Haiku Docs, Haiku slide and Haiku sheets)

~/api · this article 200

$curl api.wpnews.pro/v1/news/regressive-plasticity-sc…

Read original on discuss.huggingface.co → discuss.huggingface.co/t/regressive-plasticity-s…

mentioned entities

Qwen3-8B

Alibaba Model Studio

ARC-AGI-1

ARC-AGI-2

Regressive Plasticity Schedule

Equal Plasticity Schedule

metadata

slugregressive-plasticity-schedule-a-two-stage-post-training-schedule-for-arc

topic#machine-learning

secondary3 topics

sentimentpositive

canonicaldiscuss.huggingface.co

navigation

← prevModel Showdown Round 7: Five Loc…

next →DeepAI Bundles Creative Tools, F…

── more in #machine-learning 4 stories · sorted by recency

startupfortune.com · 3 Aug · #machine-learning

Zhipu AI stays silent as reports point to a trillion-parameter GLM-5.5

github.com · 3 Aug · #machine-learning

R-457 – a 27M-parameter reasoning model running offline across two ESP32-S3

machinebrief.com · 3 Aug · #machine-learning

SAF-OPD: Stable Advantage Fusion for On-Policy Distillation

machinebrief.com · 31 Jul · #machine-learning

AWARE-FX: An Auditable Knowledge-Guided AI System for Measuring Corporate Foreign-Exchange Hedging Disclosure

── more on @qwen3-8b 3 stories trending now

wpnews · 2 Aug · #artificial-intelligence

I Ran 8 AI APIs Through the Same 50 Prompts — Here's the Real Cost Breakdown

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

wpnews · 2 Aug · #artificial-intelligence

Payment Rail vs. Settlement Layer: What AEON's Coinbase x402 Partnership Actually Validates

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required