04:00
2026-05-27
arxiv.org
machine-learning
InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization
Researchers have developed InfoQuant, a training-free method that reshapes activation distributions in large language models to improve low-bit quantization efficiency. The approach, which uses Peak Sโฆ