SMs — Web Pulse coverage Speculative decoding: when and why it actually speeds up inference :: https://wpnews.pro/news/speculative-decoding-when-and-why-it-actually-speeds-up-inference