17:02
2026-05-29
lesswrong.com
ai-safety
Retrying vs Resampling in AI Control
Researchers at BashControl released a new paper revisiting AI control protocols, comparing resampling strategies from their earlier Ctrl-Z study against retrying protocols similar to those used in Claβ¦