15:53
2026-05-29
lesswrong.com
machine-learning
When Are Two Networks the Same?
Researchers have developed a tensor similarity method that can detect changes in neural network behavior, such as backdoor attacks, by comparing the weight-space structure of models rather than just tโฆ