04:00
2026-06-19
arxiv.org
large-language-models
Pruning via Causal Attribution Preserves Reasoning Performance in Large Language Models
Researchers introduced Causal Attribution Pruning (CAP), a training-free method that identifies critical attention heads in large language models by measuring their causal impact on reasoning tasks. Cโฆ