From Kernel Scheduler to Python Source Line: Tracing a GPU Stall End to End
An eBPF agent that attaches to the CUDA runtime, CUDA driver, and Linux kernel scheduler simultaneously can trace a GPU stall back to the exact Python source line that triggered it. The tool correlates events across all …