18:30
2026-06-26
research.google
large-language-models
Accelerating Gemini Nano models on Pixel with frozen Multi-Token Prediction
Google announced a method to retrofit Multi-Token Prediction onto frozen Gemini Nano v3 models, accelerating on-device inference for Pixel 9 and 10 series devices. The approach appends a lightweight Tβ¦