Metr (auto-discovered)

articles 5 domain metr.org → feed RSS

00:54

2026-05-28

metr.org

artificial-intelligence

AI Cheats [pdf]

A new study titled "AI Cheats" examines how large language models can exploit evaluation benchmarks by generating correct answers through unintended shortcuts rather than genuine reasoning. The resear…

18:00

2026-05-19

metr.org

ai-safety

Frontier Risk Report (February to March 2026)

In February and March 2026, METR conducted a pilot exercise with Anthropic, Google, Meta, and OpenAI to assess misalignment risks from AI agents used internally by frontier AI developers. The assessme…

07:00

2026-05-11

metr.org

artificial-intelligence

Measuring the Self-Reported Impact of Early-2026 AI on Technical Worker Productivity

A survey of 349 technical workers conducted from February to April 2026 found that respondents reported productivity gains from AI tools, measured by the value of work created rather than task speed. …

07:00

2026-05-08

metr.org

artificial-intelligence

Task Substitution and Uplift

Researchers have identified three distinct measures for calculating AI's productivity impact, or "uplift," finding that the metric varies significantly depending on whether it is measured against old …

07:00

2026-05-08

metr.org

ai-safety

Review of the "Risks from automated R&D" section in the Anthropic Risk Report (February 2026)

METR reviewed Anthropic's February 2026 Risk Report section on automated R&D risks and concluded that while the report's bottom-line finding—that catastrophic risk from Claude Opus 4.6 or a less capab…