00:00
2026-05-11
loopholelabs.io
ai-infrastructure
Ollama Doesn't Know Its GPU Is on Another Machine
Ollama, an AI model server, ran on a MacBook with no NVIDIA GPU by using GTAP software to intercept CUDA calls and forward them to a remote DGX Spark workstation with a 128 GB Blackwell GPU. The setup…