Against the METR Graph
AI researcher Nathan Witkin has challenged the validity of METR's widely-cited Long Tasks benchmark, arguing its methodology is fundamentally flawed despite its status as a leading indicator of AI cap…
AI researcher Nathan Witkin has challenged the validity of METR's widely-cited Long Tasks benchmark, arguing its methodology is fundamentally flawed despite its status as a leading indicator of AI cap…
President Trump abruptly canceled a planned executive order that would have established voluntary pre-deployment safety evaluations for frontier AI models, hours before the scheduled signing ceremony.…
At a Berkeley conference on AI control, attendees participated in a roleplaying game where each player acted as a double-dealing AI agent with a secret side task, while also monitoring others for susp…
OpenAI has reversed its position on a controversial Illinois AI bill, disavowing the liability shield provision it previously endorsed and instead backing a stronger safety bill that requires third-pa…
U.S. Rep. Val Hoyle (D-OR) initially distanced herself from the pro-artificial intelligence super PAC Leading the Future after it endorsed her, stating she did not seek the endorsement and criticizing…
Silicon Valley executives have successfully promoted the narrative of an AI race with China to advance their policy agenda in Washington, according to a forthcoming academic paper. The tech industry's…
The White House has asked Anthropic to halt further expansion of its Mythos AI model due to national security concerns about its cyber capabilities, marking the first instance of direct government con…
Google signed a classified deal Monday allowing the Pentagon to use its AI models for "any lawful governmental purpose," surprising its own researchers who had been assured by senior management that t…
A Molotov cocktail was thrown at OpenAI CEO Sam Altman's San Francisco mansion in April, and a 20-year-old man was arrested after allegedly attempting to break into OpenAI's headquarters with a manife…