cd /news/ai-infrastructure/ask-hn-how-is-gpu-power-draw-measure… · home topics ai-infrastructure article
[ARTICLE · art-40663] src=news.ycombinator.com ↗ pub= topic=ai-infrastructure verified=true sentiment=· neutral

Ask HN: How is GPU power draw measured at scale?

A developer on Hacker News is asking how GPU power draw is measured at scale in self-hosted setups, sharing per-GPU data from NVML polling and questioning whether per-request attribution survives at rack scale or if monitoring shifts entirely to PDU/BMC level.

read1 min views1 publishedJun 26, 2026

How do people measure power usage of GPUs at large (32x) self-hosted setups or small multi-rack setups? I've seen some PDUs which collect and transmit data, but I'm unsure of the processes and if/how people do this on small builds.

Currently, I collect NVML nvmlDeviceGetPowerUsage, polled at 100ms during inference, peak and mean per request, and get this type of data:

model mean-power range (W) spread stdev

qwen3-8b 114.3-121.9 7.6W 1.17

llama-3.1-8b-instruct 104.7-122.1 17.4W 4.29

qwen2.5-1.5b-instruct 53.7-73.0 19.3W 5.23

mistral-7b-instruct-v0.3 96.2-120.0 23.8W 6.01

qwen2.5-7b-instruct 88.7-124.5 35.8W 7.73

gemma-3-1b-it 49.4-56.7 7.3W 2.13

this is per-GPU, single-card data - I don't know whether anything like per-request attribution survives at rack scale, or whether monitoring there happens entirely at the PDU/BMC level instead.

Comments URL: [https://news.ycombinator.com/item?id=48684938](https://news.ycombinator.com/item?id=48684938)

Points: 3

── more in #ai-infrastructure 4 stories · sorted by recency
── more on @nvml 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/ask-hn-how-is-gpu-po…] indexed:0 read:1min 2026-06-26 ·