15:26
2026-05-29
dev.to
ai-safety
The UK Government Just Merged This Open-Source AI Security Benchmark Into Their National Evaluation Framework
The UK Government's AI Safety Institute has merged the open-source AgentThreatBench benchmark into its official inspect_evals framework, which is used to evaluate frontier AI models from OpenAI, Anthrβ¦