LoRi: Low-Rank Distillation for Implicit Reasoning
Researchers have developed LoRi, a low-rank distillation framework that improves implicit reasoning in large language models by aligning teacher and student reasoning trajectories within a shared low-β¦