A10G

mentions 2 type Organization feed RSS

// recent coverage 2 mentions

15:00

2026-06-19

hiraditya.github.io

large-language-models

Building vLLM from Source: A Field Guide (with all the pitfalls)

A developer building vLLM from source on an AWS g5 instance with Ubuntu 26.04 and Python 3.14 encountered multiple version-skew, driver, and toolchain issues, including a pitfall where missing nvidia-…

17:26

2026-06-02

kyrieblunders.bearblog.dev

machine-learning

I made a kernel 2.2x faster. It made my training loop 3x slower

A developer wrote a fused decode-attention kernel that ran 2.2× faster than the baseline in microbenchmarks, but when integrated into a HuggingFace `generate` call for an RL training loop, the decode …

// co-occurs with top 8 entities

HuggingFace 1 Qwen2.5-0.5B-Instruct 1 Dr. GRPO 1 GSM8K 1 CuteDSL 1 SDPA 1 vLLM 1 NVIDIA 1

// topics top 6 topics

large language models 2 ai infrastructure 2 machine learning 1 artificial intelligence 1 ai research 1 developer tools 1