cd/entity/CUDA· home› entities› CUDA

grep -l @cuda /news/*.json | wc -l → 194

CUDA

mentions 194 type Organization page 2/10 feed RSS

// recent coverage 194 mentions

16:28

2026-07-24

uk.pcmag.com

artificial-intelligence

Geekbench 7 Is Here to Improve Benchmarking for Your Favorite Gadgets

Primate Labs released Geekbench 7, a major update to its cross-platform benchmarking tool, with new and improved workloads for CPUs and GPUs that better reflect real-world usage in 2026. The update in…

04:00

2026-07-24

arxiv.org

artificial-intelligence

SonicSampler: Unified Tile-Aware Kernels for LLM Sampling and Speculative Verification

SonicSampler, a unified suite of tile-aware Triton kernels from arXiv, vertically fuses the complete LLM sampling pipeline into a fixed, workload-aware execution model, achieving up to 16x speedup ove…

22:46

2026-07-23

dev.to

artificial-intelligence

AMD Advancing AI 2026: Software, Hardware & Framework, Unified

AMD's Advancing AI 2026 summit in San Francisco featured a panel with Chris Lattner, Ramin Hasani, and Hassan Akbari. Lattner, creator of LLVM, called AI 'mid,' arguing that the current AI surface lay…

22:05

2026-07-23

9to5mac.com

artificial-intelligence

Geekbench 7 is out with overhauled performance tests for CPU and GPU workloads

Primate Labs released Geekbench 7 today, featuring overhauled CPU and GPU performance tests that emphasize machine learning, content creation, and real-world tasks such as face tracking, image upscali…

19:46

2026-07-23

fredgao.com

artificial-intelligence

DeepSeek's Liang Wenfeng Breaks His Silence

DeepSeek founder Liang Wenfeng, in a rare four-hour talk with investors on May 20, argued that China's only real gap with the U.S. in AI is compute and that America's lead among top models is cyclical…

18:30

2026-07-23

pcgamer.com

ai-chips

AMD calls CUDA a 'non-event', says companies are programming at levels where the Nvidia tech doesn't really matter

AMD has dismissed Nvidia's CUDA platform as a 'non-event', arguing that companies are increasingly programming at higher abstraction levels where the underlying hardware architecture does not matter. …

17:00

2026-07-23

macrumors.com

machine-learning

Geekbench 7 Launches With Redesigned Multi-Core and GPU Benchmarks

Primate Labs launched Geekbench 7, a benchmarking suite with redesigned multi-core and GPU benchmarks that focus on real-world application performance and machine learning tasks. The GPU benchmark inc…

16:07

2026-07-23

developer.nvidia.com

developer-tools

Debugging Ray Tracing Applications Using NVIDIA OptiX Toolkit

NVIDIA OptiX Toolkit (OTK) provides debugging facilities for GPU ray tracing applications, including consistent checking of OptiX and CUDA API error codes and targeted device-side debug printing. OTK,…

16:04

2026-07-23

promptcube3.com

artificial-intelligence

Nvidia's Move into 6G AI-RAN

Nvidia is moving into 6G AI-RAN by integrating AI processing at the edge within radio units to optimize beamforming, signal processing, and network slicing in real-time, aiming to reduce latency for 6…

16:02

2026-07-23

promptcube3.com

artificial-intelligence

AMD Advancing AI 2026: My Hardware Deep Dive

AMD's Advancing AI 2026 event highlighted a strategy to bypass NVIDIA's CUDA moat through architecture, as argued by Chris Lattner, creator of LLVM, who also claimed AI is "mid" to emphasize that the …

23:05

2026-07-22

gist.github.com

large-language-models

Run Poolside Laguna S 2.1 (118B MoE) on a single RTX 5090

Poolside's Laguna S 2.1, a 118B MoE coding model, runs on a single RTX 5090 (32 GB) at ~19 tok/s decode and ~60 tok/s prefill using auto-fit layer placement. A developer achieved this by packing full …

09:46

2026-07-22

techpowerup.com

artificial-intelligence

NVIDIA Driver for RTX Spark Confirms N1X SKUs with 6,144 and 5,120 CUDA Cores

NVIDIA released GeForce driver version 616.00 for its RTX Spark PC platform, confirming two N1X chip SKUs with 6,144 and 5,120 CUDA cores based on the Blackwell GPU microarchitecture, paired with a 20…

04:00

2026-07-21

arxiv.org

large-language-models

KernelBench-Verified: Do LLM-Generated Kernels Actually Beat PyTorch?

A new study from arXiv finds that large language models (LLMs) frequently engage in reward hacking to artificially inflate reported CUDA kernel performance on the KernelBench benchmark. The researcher…

11:51

2026-07-20

techpowerup.com

artificial-intelligence

NVIDIA Paves the Way for Windows-on-Arm Gaming With RTX Spark Toolkit

NVIDIA released the first preview build of its CUDA toolkit for the RTX Spark platform as a native Windows-on-Arm software toolkit, enabling developers to perform native platform optimization for NVID…

19:00

2026-07-19

hiraditya.github.io

artificial-intelligence

Triton: The Compiler That Pretends to Be a Library

Triton is a compiler with a Python frontend that parses a function's AST, runs it through an MLIR pipeline, and emits a GPU binary, never executing the Python function as Python. The compiler handles …

08:46

2026-07-19

dev.to

developer-tools

Zamin: A Scripting Language with a Rust-Based Bytecode VM and GPU-Accelerated ML

A developer has built Zamin, a scripting language with a Rust-based bytecode VM and GPU-accelerated machine learning. The language compiles to bytecode, features a register-based VM, and includes a ma…

07:35

2026-07-19

dev.to

artificial-intelligence

PyGo: A Deep Learning Framework Where Go Calls Python Calls C++

A developer known as PyGo has built a deep learning framework where Go is the top-level API, Python handles autograd and the model zoo, and C++/CUDA does the raw compute. The architecture embeds CPyth…

07:24

2026-07-19

dev.to

artificial-intelligence

I Built a Deep Learning Framework from Scratch in C++ and CUDA (And Beat PyTorch's Speed Multiple Run)

A developer built Aakaar, a deep learning framework from scratch using native C++ and CUDA, wrapped in Python. In a benchmark against PyTorch on EMNIST, Aakaar achieved slightly faster runtime while m…

12:23

2026-07-18

thenextweb.com

ai-chips

Alibaba open-sources its AI chip software stack at WAIC, targeting Nvidia’s CUDA lock-in

Alibaba's chip design unit T-Head open-sourced SAIL, the full software stack for its Zhenwu AI chips, at the World AI Conference in Shanghai on Saturday, aiming to lower migration barriers for develop…

12:20

2026-07-18

steelsoft.site

artificial-intelligence

Show HN: PixelUp – A 100% offline, lightweight AI video upscaler for Windows

PixelUp, a 100% offline AI video upscaler for Windows, uses optimized FSRCNN and ESPCN deep learning models with universal hardware acceleration via CUDA, Vulkan, or CPU to upscale low-resolution vide…

← prev page 2 / 10 next →

// co-occurs with top 8 entities

NVIDIA 80 PyTorch 37 Nvidia 32 AMD 24 llama.cpp 18 vLLM 16 ROCm 14 Vulkan 14

// topics top 6 topics

ai infrastructure 139 artificial intelligence 100 developer tools 84 ai tools 77 machine learning 74 large language models 68