04:00
2026-06-19
arxiv.org
machine-learning
How Linear Is a Transformer Feed-Forward Block? Per-Block Linear Recoverability Is Learned, Not Architectural
A new study measures the linearity of transformer feed-forward blocks, finding that linear recoverability (Rยฒ_lin) varies widely across blocks and is a learned property, not an architectural one. The โฆ