17:05
2026-05-01
gist.github.com
large-language-models
Qwen 3.6-35B-A3B FP8 (MoE, 3B active) on DGX Spark GB10
A developer deployed the Qwen 3.6-35B-A3B FP8 mixture-of-experts model (3 billion active parameters) on a DGX Spark GB10 system using vLLM, achieving inference with a 262,144-token context window and โฆ