04:15
2026-06-23
swelljoe.com
ai-safety
Will It Mythos?
A developer created a benchmark to test whether AI models besides Anthropic's Mythos can find security bugs as effectively, using bugs Mythos discovered. The benchmark currently includes nine bugs froβ¦