Monika J.

mentions 1 type Person feed RSS

// recent coverage 1 mentions

18:43

2026-06-21

lesswrong.com

ai-safety

Introducing MonitoringBench

Researchers released MonitoringBench, a benchmark of 2,644 attack trajectories for evaluating coding-agent monitors, along with a semi-automated red-teaming pipeline. The pipeline decomposes attack co…

// co-occurs with top 7 entities

MonitoringBench 1 Opus 4.5 1 GPT-5 1 BashArena 1 Ma-Martinez 1 Ollie 1 Tyler Tracy 1