06:34
2026-06-18
arxiv.org
machine-learning
Explaining Attention with Program Synthesis
Researchers at an undisclosed institution developed a method to replace attention heads in transformer language models with human-readable Python programs, achieving over 75% similarity in attention pโฆ