Easy solution to slow down recursive AI self improvement:
- The lab with the top-ranked model must agree THEY must not use it for working on frontier AI
- But everyone else should have access to it. By definition, this means the frontier doesn't advance.
It also has the critical benefit of avoiding a dangerous power imbalance.
Anthropic has chosen the
oppositeof the safe path: they are allowing themselves, the current top lab, to use their top model for frontier AI research. They've said they'll sabotage others who try.This means the AI frontier advances, & power imbalance increases.
(To be clear,
Idon't think we should try to slow down recursive AI self improvement - I think we should open it up and democratize it as much as possible. My point is: ifyouclaim we should slow down, and you have the best model, you should ensure your org can't use it.)
— Jeremy Howard, in a Twitter thread
Tags: ai-ethics, anthropic, generative-ai, claude-mythos, jeremy-howard, ai, llms