ScienceWorld

mentions 5 type Organization feed RSS

// recent coverage 5 mentions

04:00

2026-07-21

arxiv.org

artificial-intelligence

Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL

Researchers introduce masked diffusion language models (MDLMs) as steerable text-based world models for reinforcement learning, achieving up to 47% absolute gains over baselines in zero-shot transfer …

04:37

2026-07-16

machinebrief.com

artificial-intelligence

A New Path for AI: Overcoming the Pitfalls of Long-Horizon Tasks

A new framework called Experience Memory Graph (EMG) offers a promising solution to the persistent challenge of AI agents failing in complex, long-horizon tasks by treating error recovery as a graph m…

20:25

2026-07-10

machinebrief.com

artificial-intelligence

Exploring New AI Pathways: TREK's Innovative Approach

Researchers introduced TREK (Teacher-Routed Exploration via Forward KL), a new AI training method that enhances learning through unconventional exploration strategies. TREK significantly improved perf…

20:24

2026-07-10

machinebrief.com

artificial-intelligence

TREK: A New Path in AI Problem Solving

Researchers introduced TREK, a method that improves AI problem-solving by using verified output trajectories to extend model learning. TREK boosted Qwen3-8B's performance on AIME 2024 from 36.9 to 40.…

04:00

2026-06-05

arxiv.org

machine-learning

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

Researchers have developed CVT-RL, a reinforcement learning algorithm that uses policy-conditioned counterfactual credit assignment to reduce unsupported evidence chains and shortcut actions in long-h…

// co-occurs with top 8 entities

ALFWorld 5 Qwen3 3 TREK 2 CVT-RL 1 PCCC 1 AIME 1 Group Relative Policy Optimization 1 AIME 2024 1

// topics top 6 topics

machine learning 4 artificial intelligence 4 large language models 3 ai agents 3 ai research 3 ai safety 1