cd/entity/CyScenarioBenchΒ· homeβ€Ί entitiesβ€Ί CyScenarioBench
grep -l @cyscenariobench /news/*.json | wc -l β†’ 1

CyScenarioBench

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

22:37
2026-06-26
irregular.com
ai-safety

Assessing GPT-5.6 Sol Against Cybersecurity Benchmarks

Irregular, in collaboration with OpenAI, tested GPT-5.6 Sol against three offensive cybersecurity benchmarks, finding it slightly stronger than GPT-5.5. The model solved 19 of 197 FrontierCyber challe…

// co-occurs with top 5 entities