cd/entity/PyTorch· home› entities› PyTorch

grep -l @pytorch /news/*.json | wc -l → 154

PyTorch

mentions 154 type Organization page 7/8 feed RSS

// recent coverage 154 mentions

02:11

2026-05-28

runwayml.com

machine-learning

DTensor, Correctness and the Costs of Abstraction

DTensor, PyTorch's distributed tensor abstraction, attaches placement metadata to every tensor to automatically propagate layouts and insert correct collective operations during distributed training. …

00:00

2026-05-28

mindstudio.ai

artificial-intelligence

What Is ROCm? AMD's Open Compute Platform for AI and Deep Learning

AMD's ROCm (Radeon Open Compute platform) has reached production-ready maturity as an open-source alternative to NVIDIA's CUDA, now supporting LLM inference, fine-tuning, and image generation on AMD G…

00:00

2026-05-28

mindstudio.ai

artificial-intelligence

Running Local AI on AMD: ROCm, Ollama, and LM Studio Performance in 2026

AMD's ROCm platform now supports PyTorch, Ollama, LM Studio, and ComfyUI out of the box, enabling local AI workloads on AMD GPUs without the compatibility issues that plagued earlier versions. Users w…

19:09

2026-05-27

pytorch.org

machine-learning

Why Is PyTorch Compile So Fast: Kernel Fusion

PyTorch's Inductor compiler uses kernel fusion to accelerate model execution by up to 10x, grouping dependent operations into single Triton kernels to reduce memory traffic and kernel launch overhead.…

11:02

2026-05-27

dev.to

ai-infrastructure

TensorCircuit-NG: Quantum Software On AI, For AI, With AI

TensorCircuit-NG, a quantum software stack built on AI infrastructure, treats quantum circuits as specialized tensor operations to leverage existing AI tooling for automatic differentiation, compilati…

06:53

2026-05-27

benfrederickson.com

machine-learning

Python as a Declarative Programming Language (2017)

Python's performance in the benchmarks game is roughly 40 times slower than C or C++, yet it remains the dominant language for data analysis and machine learning because core libraries like NumPy, Ten…

05:37

2026-05-27

dev.to

machine-learning

The bf16 grad accumulator that killed our SDXL LoRA training

Photoroom's SDXL LoRA fine-tuning for a product photography model silently corrupted its adapter weights over six days due to a bf16 gradient accumulation issue. The custom training loop, forked from …

22:08

2026-05-26

developer.nvidia.com

ai-infrastructure

Extract More Kernel Performance with NVIDIA CompileIQ Auto-Tuning

NVIDIA released CompileIQ, an AI-powered compiler auto-tuning framework that uses evolutionary and genetic algorithms to optimize GPU compilers for individual workloads. The tool, included in NVIDIA C…

21:36

2026-05-26

dev.to

ai-tools

Only 14.6% of 'AI-native' job postings actually name an AI tool. I checked 37,920.

Four-Leaf's AI Stack Index analyzed 37,920 job postings from public companies and found that only 14.6% mention any of 75 tracked AI tools or skills. The most common AI skill, agentic AI, appears in j…

13:56

2026-05-26

github.com

ai-infrastructure

Wave – A universal GPU instruction set architecture

A new open-source project called WAVE has introduced a vendor-neutral GPU instruction set architecture that allows developers to write GPU code once and run identical binaries on NVIDIA, AMD, Apple, a…

12:51

2026-05-26

klongpy.org

machine-learning

KlongPy: PyTorch Back End and Autograd

KlongPy now supports a PyTorch backend that enables GPU acceleration and automatic differentiation for gradient-based computations. The torch backend outperforms NumPy by up to 8x on large arrays and …

03:31

2026-05-26

dev.to

machine-learning

I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

A developer with 17 years of distributed systems and SRE experience built torchdiag, a diagnostic toolkit for PyTorch that provides five commands to measure gradient flow, detect dead neurons, and ver…

00:41

2026-05-26

dev.to

machine-learning

Predicting Blood Glucose Fluctuations: Building a Transformer-based CGM Forecaster with PyTorch & InfluxDB

A developer built a Transformer-based time-series forecaster using PyTorch to predict blood glucose levels 30 minutes in advance from Continuous Glucose Monitoring (CGM) data. The model, which uses se…

08:25

2026-05-25

tinyvolt.com

ai-tools

Show HN: Geomatic – a command-driven geometry studio enabled with autodiff

Geomatic, a command-driven geometry studio with automatic differentiation, allows users to create and manipulate points, lines, and scalars using simple command syntax. The tool supports NumPy-like br…

12:11

2026-05-23

thonking.ai

artificial-intelligence

Matrix Multiplications on GPUs Run Faster When Given "Predictable" Data

Matrix multiplications on Nvidia A100 GPUs run up to 15% faster when the input matrices contain predictable values like zeros or integers, rather than random data. The performance difference stems fro…

11:50

2026-05-23

horace.io

machine-learning

Making Deep Learning Go Brrrr from First Principles (2022)

The article explains that optimizing deep learning performance should be approached by identifying whether a system is bottlenecked by compute, memory bandwidth, or overhead, rather than relying on ad…

11:50

2026-05-23

horace.io

machine-learning

Making Deep Learning Go Brrrr from First Principles

The article explains that optimizing deep learning performance should be approached by reasoning from first principles—identifying whether a system is bottlenecked by compute, memory bandwidth, or ove…

20:24

2026-05-22

pytorch.org

machine-learning

Join the PyTorch Foundation Ambassador Program: A Global Network of Community Leaders

The PyTorch Foundation reopened applications for its Ambassador Program, seeking community leaders to mentor users, create tutorials, and organize events for a two-year term. The foundation especially…

18:00

2026-05-22

gist.github.com

artificial-intelligence

Fix "tensorWritesFailed" for Flux.2 Klein 9B models in Draw Things — handles all CivitAI variants (ComfyUI FP8, BF16, F8_E5M2)

This article describes a Python script that fixes the "tensorWritesFailed" import error in Draw Things for Flux.2 Klein 9B models downloaded from CivitAI. The error occurs because Draw Things expects …

07:00

2026-05-22

leimao.github.io

machine-learning

PyTorch Triton Kernel Transparent Tracing and Compilation

PyTorch has introduced transparent tracing and compilation for Triton kernels, allowing custom operations to be visible to the compiler for optimization. The framework now supports compiling Triton ke…

← prev page 7 / 8 next →

// co-occurs with top 8 entities

NVIDIA 26 CUDA 24 JAX 15 Hugging Face 13 Python 12 TensorFlow 12 GPU 11 NumPy 9