cd/entity/DeepGEMMΒ· homeβ€Ί entitiesβ€Ί DeepGEMM
grep -l @deepgemm /news/*.json | wc -l β†’ 1

DeepGEMM

mentions 1 type Organization feed RSS
17:00
2026-06-10
pytorch.org
large-language-models

Portable vLLM Model Inference Kernels in Helion

Helion kernels were integrated into vLLM for FP8 inference using Qwen3 models and evaluated across NVIDIA H100 and B200 GPUs. The experiments demonstrated that Helion provides a productive PyTorch-nat…

// co-occurs with top 7 entities