06:07
2026-06-16
anyscale.com
artificial-intelligence
67% Cost Savings with PD Disaggregation Using Ray and vLLM on AMD MI325X
Engineers achieved up to 67% cost savings and 2.7x better goodput by using Prefill-Decode disaggregation with Ray and vLLM on AMD MI325X GPUs, separating prefill and decode phases onto dedicated hardwβ¦