ls /news · home news
grep -r --recent /news | head -20

News

9901 articles page 252 of 496 0 sources 30 min sync cycle
04:00
2026-05-25
arxiv.org
ai-agents · 1m read ↑ pos

EVE-Agent: Evidence-Verifiable Self-Evolving Agents

Researchers have introduced EVE-Agent, a self-evolving search agent that requires each training example to include a source-grounded evidence span verifying its answer. The system uses a proposer-solver framework where a…

04:00
2026-05-25
arxiv.org
artificial-intelligence · 1m read · neu

Design and Report Benchmarks for Knowledge Work

Researchers have identified a fundamental flaw in how AI agents are evaluated for knowledge work, finding that higher benchmark scores do not reliably indicate real-world performance. The team proposes a three-step frame…

04:00
2026-05-25
arxiv.org
large-language-models · 1m read · neu

DART: Semantic Recoverability for Structured Tool Agents

Researchers have developed DART, a modular runtime that addresses the failure of structured tool agents mid-execution by certifying semantically recoverable boundaries before restoring from a local checkpoint. The system…

← prev page 252 / 496 next →
LIVE [news] indexed:9901 page:252/496 en · ua 2026-05-20 ·