cd/entity/CUDA· home entities CUDA
grep -l @cuda /news/*.json | wc -l → 36

@CUDA

mentions 36 type Organization page 1/2 feed RSS
03:22
2026-06-06
dev.to
large-language-models

Run Gemma-4 12B on WSL2 with llama.cpp

A developer has published a guide for running Google's Gemma-4 12B instruction-tuned model on Windows Subsystem for Linux 2 (WSL2) using the llama.cpp framework. The process involves installing build …

21:31
2026-06-05
vettedconsumer.com
artificial-intelligence

NVIDIA Jetson Orin Nano Super: The $249 Way Into Edge AI

NVIDIA released the Jetson Orin Nano Super Developer Kit for $249, offering up to 67 TOPS of AI performance for edge computing and robotics. The tiny dev board runs the full NVIDIA AI software stack a…

19:30
2026-06-05
gilesthomas.com
machine-learning

JAX backends and devices

JAX defaults to loading data directly onto GPU memory when a CUDA-enabled version is installed, causing out-of-memory errors for large datasets that would fit in system RAM. The framework's `jax.devic…

22:55
2026-06-02
github.com
ai-infrastructure

Use your Nvidia GPU's VRAM as swap space on Linux

A developer released a tool that allows Linux users to repurpose Nvidia GPU VRAM as swap space, effectively tripling addressable memory on systems with soldered RAM. The nbd-vram daemon uses CUDA's me…

18:43
2026-06-01
docs.pytorch.org
machine-learning

When does fragmentation occur in the CUDA caching allocator?

The CUDA caching allocator in PyTorch fragments memory when allocated blocks prevent adjacent free blocks from merging, causing allocation failures despite sufficient total free memory. This fragmenta…

15:39
2026-06-01
leimao.github.io
machine-learning

PyTorch Custom Operation

PyTorch users can now implement custom operations in C++ and CUDA for use in both Python and C++ inference programs, with automatic device dispatch between CPU and CUDA implementations. The approach s…

14:18
2026-06-01
blogs.windows.com
artificial-intelligence

Introducing Surface Laptop Ultra: Made for world makers

Microsoft announced the Surface Laptop Ultra, a device built in partnership with NVIDIA and designed for creators, developers, and AI builders. The laptop features an NVIDIA Blackwell RTX GPU, up to 1…

09:57
2026-06-01
phoronix.com
ai-infrastructure

NBD-VRAM Provides Swap Space On Your NVIDIA GeForce GPUs

An open-source developer released NBD-VRAM, a tool that creates swap space on consumer NVIDIA GeForce GPUs under Linux. The software targets laptop users with soldered memory who need additional syste…

18:13
2026-05-29
gist.github.com
large-language-models

Benchmark Qwen3.6 27B on Modal

A developer benchmarked the Qwen3.6 27B model on Modal using llama.cpp, deploying a serverless pipeline that downloads GGUF shards from Hugging Face and runs perplexity evaluation on an A100-80GB GPU.…

12:51
2026-05-26
klongpy.org
machine-learning

KlongPy: PyTorch Back End and Autograd

KlongPy now supports a PyTorch backend that enables GPU acceleration and automatic differentiation for gradient-based computations. The torch backend outperforms NumPy by up to 8x on large arrays and …

16:17
2026-05-25
blog.mempko.com
artificial-intelligence

The Open/Closed Problem in AI

At the ninth MLSys conference in Seattle, researchers and industry leaders focused overwhelmingly on improving the efficiency of training and deploying large language models, with specialized hardware…

page 1 / 2 next →
// co-occurs with top 8 entities