16:06
2026-06-25
lesswrong.com
large-language-models
Exploration: fine-tuning with parameter decomposition
Researchers at Goodfire demonstrated that fine-tuning a single scalar prefactor on a German-related rank-1 parameter subcomponent of a 67M-parameter language model can destroy its ability to predict Gโฆ