04:00
2026-06-15
arxiv.org
large-language-models
SuperThoughts: Reasoning Tokens in Superposition
Researchers propose SuperThoughts, a method that compresses pairs of consecutive Chain-of-Thought tokens into single latent representations and decodes two tokens per step, doubling inference throughpβ¦