GPU

mentions 87 type Organization page 5/5 feed RSS

// recent coverage 87 mentions

15:38

2026-05-22

dwarkesh.com

ai-chips

Reiner Pope – Chip design from the bottom up

Reiner Pope, CEO of AI chip startup MatX and former Google engineer, delivered a blackboard lecture explaining chip design from basic logic gates to the architectures of GPUs, TPUs, FPGAs, and the hum…

04:54

2026-05-22

arxiv.org

machine-learning

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

CODA, a GPU kernel abstraction that reparameterizes memory-bound Transformer operations like normalization and activations to execute as GEMM-plus-epilogue programs, keeping data on-chip to reduce glo…

11:37

2026-05-21

dev.to

large-language-models

End-to-End Observability for vLLM and TGI: from DCGM to Tokens

Running large language model inference servers like vLLM and TGI in production requires specialized observability because they behave differently from standard web services, with key metrics like late…

03:25

2026-05-20

dev.to

artificial-intelligence

GPUs, Data Security, and the AI Performance Race: Running Powerful Models Without Losing Control of Your Data

Practical guide for engineers and security teams evaluating whether to run large AI models locally, in private cloud, or via secure enterprise platforms. It argues that AI performance depends not only…

18:18

2026-05-13

newsletter.semianalysis.com

ai-chips

Cerebras — Faster Tokens Please

Cerebras Systems has secured a 750MW compute deal with OpenAI, positioning the company for its upcoming IPO as demand for fast token generation surges. The wafer-scale chip maker's speed advantages, p…

18:56

2026-04-30

pytorch.org

large-language-models

SMG: The Case for Disaggregating CPU from GPU in LLM Serving

Shepherd Model Gateway (SMG) has disaggregated all CPU-bound workloads from GPU inference in large language model serving, moving tokenization, detokenization, and parsing into a dedicated Rust gatewa…

15:19

2023-02-25

alexselimov.com

open-source

Hosting your own git frontend service using Gitea

The article provides a step-by-step guide on self-hosting a Git frontend service using Gitea on a Debian server with Nginx. It covers setting up a PostgreSQL database for Gitea, downloading and instal…

← prev page 5 / 5

// co-occurs with top 8 entities

PyTorch 11 LLM 9 CPU 8 CUDA 7 vLLM 6 NVIDIA 6 HBM 6 TPU 5