16:43
2026-06-25
developer.nvidia.com
artificial-intelligence
Scaling AI Inference Across Multiple GPUs Using NVIDIA TensorRT with Multi-Device Inference Support
NVIDIA announced multi-device inference support in TensorRT 11.0, enabling native high-performance multi-GPU inference for generative AI workloads. The feature integrates with NCCL for distributed colโฆ