18:42
2026-06-18
lesswrong.com
ai-safety
On “Model Organisms”
A researcher at Arcadia Impact's Alignment Team draws parallels between model organisms in biology and AI safety research, arguing that studying specific language models can reveal general principles …