cd/entity/AIME 2024ยท homeโ€บ entitiesโ€บ AIME 2024
grep -l @aime 2024 /news/*.json | wc -l โ†’ 1

@AIME 2024

mentions 1 type Person feed RSS
00:00
2026-04-20
andlukyane.com
large-language-models

FIPO: Teaching LLMs Which Thoughts Actually Matter

FIPO (Future-Impact-based Policy Optimization) is a reinforcement learning method that improves LLM reasoning by assigning token-level credit based on each token's future impact on the policy, rather โ€ฆ

// co-occurs with top 4 entities