PIQA

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-06-29

arxiv.org

large-language-models

Prism Transformer: Progressive Head Schedules for Hierarchical Attention Processing

Researchers introduce the Prism Transformer, a new architecture that progressively increases head counts across layers to create a hierarchical attention processing structure. This design improves per…

// co-occurs with top 5 entities

Prism Transformer 1 arXiv 1 HellaSwag 1 ARC-Easy 1 WinoGrande 1