17:58
2026-06-17
lesswrong.com
ai-safety
Porting MACHIAVELLI To Inspect
A developer ported the MACHIAVELLI benchmark, which measures unethical AI agent behavior, to the Inspect evaluation framework to make it easier for evaluators to use. The re-implementation is now offi…