Qwen3.6-35B-A3B-NVFP4

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

10:37

2026-06-18

dev.to

large-language-models

Qwen3.6-35B NVFP4 runs on one H100 — A100 owners are out

NVIDIA released Qwen3.6-35B-A3B-NVFP4, a post-training FP4-quantized variant of Alibaba's 35B MoE model that fits on a single H100 by reducing VRAM from ~71 GB to ~23 GB. The quantization targets weig…

// co-occurs with top 7 entities

NVIDIA 1 Alibaba 1 H100 1 A100 1 Hopper 1 Blackwell 1 Model Optimizer 1

// topics top 5 topics

large language models 1 ai infrastructure 1 ai research 1 ai products 1 developer tools 1