AMD Strix Halo RDMA Cluster Setup Guide AMD Strix Halo cluster setup guide details how to configure a two-node system linked via Intel E810 RoCE v2 for distributed vLLM inference using Tensor Parallelism. The guide covers hardware prerequisites, host configuration on Fedora 43, and running the cluster with Ray and RCCL. The setup enables low-latency communication between nodes, making them behave like a single machine for large model inference. This guide details how to configure a two-node AMD Strix Halo cluster linked via Intel E810 RoCE v2 for distributed vLLM inference using Tensor Parallelism. TL;DR Quick Start 1-tldr-quick-start Concepts & Architecture 2-concepts--architecture Hardware Prerequisites 3-hardware-prerequisites Host Configuration Fedora 4-host-configuration-fedora Toolbox Installation & Network Verification 5-toolbox-installation--network-verification Running the Cluster 6-running-the-cluster Troubleshooting 7-troubleshooting References & Acknowledgements 8-references--acknowledgements On Both Nodes: Preparation : Install/Update Fedora 43 and the E810 NICs Check firmware: ethtool -i