Inspect

mentions 3 type Organization feed RSS

// recent coverage 3 mentions

00:21

2026-07-25

lesswrong.com

ai-safety

Orbit: A framework for multi-agent security evaluations

The Cooperative AI Foundation and MATS program released v0 of Orbit, a framework for multi-agent safety and security evaluations built on Inspect, designed to address risks from miscoordination, confl…

20:40

2026-07-23

evalevalai.com

artificial-intelligence

One Schema for Every Eval

The EvalEval Coalition has launched a unified, open data format and public dataset for AI evaluation results, aiming to address fragmentation and enable trust and comparability across frameworks. The …

17:58

2026-06-17

lesswrong.com

ai-safety

Porting MACHIAVELLI To Inspect

A developer ported the MACHIAVELLI benchmark, which measures unethical AI agent behavior, to the Inspect evaluation framework to make it easier for evaluators to use. The re-implementation is now offi…

// co-occurs with top 8 entities

MACHIAVELLI 1 Apollo Research 1 Claude 1 Opus 1 Sonnet 1 Qwen 1 GPT-4 1 EvalEval Coalition 1

// topics top 6 topics

ai research 3 ai safety 2 ai agents 2 ai tools 2 ai ethics 1 artificial intelligence 1