cd/entity/CUDA· home entities CUDA
grep -l @cuda /news/*.json | wc -l → 43

CUDA

mentions 43 type Organization page 1/3 feed RSS
09:55
2026-06-13
imil.net
large-language-models

RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8

A user combined an RTX 5080 and RTX 3090 on an Asus Prime X570-Pro motherboard to run Qwen 3.6 27B Q8 at over 80 tokens per second. The setup required disabling CSM, enabling Above 4G Decoding and ReS…

21:40
2026-06-11
docs.rapids.ai
ai-infrastructure

Polars GPU Engine

NVIDIA's cuDF library now provides GPU-accelerated execution engines for the Polars Lazy API, enabling users to run dataframe operations on GPUs with automatic fallback to CPU when operations are unsu…

16:25
2026-06-10
phoronix.com
artificial-intelligence

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

AMD released Lemonade SDK version 10.7, adding NVIDIA CUDA support to its local AI server solution that previously only supported AMD hardware, Apple Metal GPUs, and AArch64 CPUs. The update integrate…

15:41
2026-06-09
modular.com
ai-infrastructure

What about OpenCL and CUDA C++ alternatives?

Chris Lattner, a lead engineer on Apple's original OpenCL implementation, explains why OpenCL and other C++-based GPU programming models failed to become dominant in AI, citing slow committee-driven d…

03:22
2026-06-06
dev.to
large-language-models

Run Gemma-4 12B on WSL2 with llama.cpp

A developer has published a guide for running Google's Gemma-4 12B instruction-tuned model on Windows Subsystem for Linux 2 (WSL2) using the llama.cpp framework. The process involves installing build …

21:31
2026-06-05
vettedconsumer.com
artificial-intelligence

NVIDIA Jetson Orin Nano Super: The $249 Way Into Edge AI

NVIDIA released the Jetson Orin Nano Super Developer Kit for $249, offering up to 67 TOPS of AI performance for edge computing and robotics. The tiny dev board runs the full NVIDIA AI software stack a…

19:30
2026-06-05
gilesthomas.com
machine-learning

JAX backends and devices

JAX defaults to loading data directly onto GPU memory when a CUDA-enabled version is installed, causing out-of-memory errors for large datasets that would fit in system RAM. The framework's `jax.devic…

22:55
2026-06-02
github.com
ai-infrastructure

Use your Nvidia GPU's VRAM as swap space on Linux

A developer released a tool that allows Linux users to repurpose Nvidia GPU VRAM as swap space, effectively tripling addressable memory on systems with soldered RAM. The nbd-vram daemon uses CUDA's me…

18:43
2026-06-01
docs.pytorch.org
machine-learning

When does fragmentation occur in the CUDA caching allocator?

The CUDA caching allocator in PyTorch fragments memory when allocated blocks prevent adjacent free blocks from merging, causing allocation failures despite sufficient total free memory. This fragmenta…

15:39
2026-06-01
leimao.github.io
machine-learning

PyTorch Custom Operation

PyTorch users can now implement custom operations in C++ and CUDA for use in both Python and C++ inference programs, with automatic device dispatch between CPU and CUDA implementations. The approach s…

14:18
2026-06-01
blogs.windows.com
artificial-intelligence

Introducing Surface Laptop Ultra: Made for world makers

Microsoft announced the Surface Laptop Ultra, a device built in partnership with NVIDIA and designed for creators, developers, and AI builders. The laptop features an NVIDIA Blackwell RTX GPU, up to 1…

09:57
2026-06-01
phoronix.com
ai-infrastructure

NBD-VRAM Provides Swap Space On Your NVIDIA GeForce GPUs

An open-source developer released NBD-VRAM, a tool that creates swap space on consumer NVIDIA GeForce GPUs under Linux. The software targets laptop users with soldered memory who need additional syste…

18:13
2026-05-29
gist.github.com
large-language-models

Benchmark Qwen3.6 27B on Modal

A developer benchmarked the Qwen3.6 27B model on Modal using llama.cpp, deploying a serverless pipeline that downloads GGUF shards from Hugging Face and runs perplexity evaluation on an A100-80GB GPU.…

page 1 / 3 next →
// co-occurs with top 8 entities