cd/entity/Acrobot-v1Β· homeβ€Ί entitiesβ€Ί Acrobot-v1
grep -l @acrobot-v1 /news/*.json | wc -l β†’ 1

@Acrobot-v1

mentions 1 type Organization feed RSS
04:00
2026-05-26
arxiv.org
machine-learning

Not All Transitions Matter: Evidence from PPO

Researchers found that removing 25% of transitions from reinforcement learning rollout data stabilizes PPO training by breaking repetitive gradient structures caused by causally chained states. The me…

// co-occurs with top 5 entities