cd/entity/GPU· home entities GPU
grep -l @gpu /news/*.json | wc -l → 87

GPU

mentions 87 type Organization page 4/5 feed RSS

// recent coverage 87 mentions

08:53
2026-06-05
letsdatascience.com
artificial-intelligence

AI-Powers Worm Exploits Stolen Compute to Infect Mixed Devices

Researchers published a proof-of-concept AI-driven worm that embeds an open-weight LLM on compromised GPUs to autonomously scan, exploit, and propagate across Linux, Windows, and IoT devices. The worm…

00:31
2026-06-04
dev.to
ai-agents

Your Agent Has a Memory That Runs While You Sleep

A developer built a continuous AI agent memory system called `akm improve` that runs autonomously on local hardware, processing 14,189 memories across 48 scheduled runs in 24 hours with zero failures.…

00:00
2026-05-31
cefboud.com
large-language-models

Exploring Speculative Decoding: From Concept to Implementation

Speculative decoding optimizes LLM inference by using a cheap draft model to predict multiple tokens, which are then verified in a single forward pass of the target model, reducing memory-bandwidth bo…

23:18
2026-05-30
categoryvc.com
ai-chips

AI Hardware

Modern GPUs spend most of their time during AI inference waiting for data, as memory bandwidth cannot keep pace with compute throughput. This fundamental bottleneck has driven the AI hardware market, …

03:52
2026-05-29
dev.to
neural-networks

Tensors Explained Part 2: Why Tensors Are Useful

Tensors enable hardware acceleration by leveraging GPUs and TPUs to perform parallel mathematical operations efficiently, making them essential for training neural networks. They also support automati…

19:50
2026-05-28
arxiv.org
artificial-intelligence

SIA: Self Improving AI with Harness and Weight Updates

Researchers have developed SIA, a self-improving AI system that updates both its own software scaffolding and internal model weights without human intervention, combining two previously separate appro…

18:39
2026-05-28
letsdatascience.com
machine-learning

MoE Transforms Open Model Ecosystem Costs

Mixture of Experts (MoE) models are reshaping the economics of open-model deployments by reducing GPU inference costs and altering serving stack requirements. The shift toward MoE architectures in 202…

11:53
2026-05-28
github.com
large-language-models

Why LLM decode is memory-bound, not compute-bound

LLM inference costs 100x more than traditional machine learning inference because autoregressive generation requires a separate forward pass through the entire model for each output token. A Llama 3.1…

12:51
2026-05-26
klongpy.org
machine-learning

KlongPy: PyTorch Back End and Autograd

KlongPy now supports a PyTorch backend that enables GPU acceleration and automatic differentiation for gradient-based computations. The torch backend outperforms NumPy by up to 8x on large arrays and …

16:35
2026-05-24
thedeepview.com
artificial-intelligence

How the compute crisis is defining the next stage of AI

Lambda Chief Commercial Officer Robert Brooks IV argued that computing power is becoming one of the most strategically important resources in the AI economy, with his company building supercomputers f…

← prev page 4 / 5 next →
// co-occurs with top 8 entities