cd/entity/PyTorch· home› entities› PyTorch

grep -l @pytorch /news/*.json | wc -l → 154

PyTorch

mentions 154 type Organization page 5/8 feed RSS

// recent coverage 154 mentions

09:54

2026-06-13

dev.to

artificial-intelligence

Build Your Own Shakespearean LLM

A developer built a character-level language model from scratch using Shakespeare's complete works, training a nanoGPT model on a consumer-grade MacBook Pro in about 15 minutes. The project demonstrat…

13:47

2026-06-12

pytorch.org

artificial-intelligence

PyTorch Meetup Singapore: A milestone in APAC

Eighty engineers, researchers, and community builders gathered for the inaugural PyTorch Meetup Singapore, hosted at the Red Hat Asia Pacific office. The event featured technical talks on inference, d…

12:00

2026-06-12

kdnuggets.com

machine-learning

3 NumPy Tricks for Numerical Performance

NumPy's vectorization and broadcasting techniques can accelerate numerical operations by up to 56x compared to explicit Python loops, as demonstrated by a column standardization task on a 50,000-row, …

07:10

2026-06-12

marktechpost.com

computer-vision

A Coding Implementation on MONAI for End-to-End 3D Spleen Segmentation Using UNet on Medical CT Volumes

MONAI, an open-source medical imaging framework, has released a tutorial demonstrating an end-to-end 3D spleen segmentation pipeline using a UNet model on CT volumes from the Medical Segmentation Deca…

00:00

2026-06-12

anyscale.com

large-language-models

Inside FSDP with PyTorch and Ray: Scaling Model Training with Fully Sharded Data Parallel

Alibaba's 1.7B parameter Qwen3-TTS voice cloning model was fine-tuned using Fully Sharded Data Parallel (FSDP) with PyTorch and Ray, demonstrating memory-efficient distributed training across 4 GPUs. …

00:00

2026-06-11

huggingface.co

machine-learning

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

PyTorch's `nn.Linear` module transposes its weight tensor before performing matrix multiplication and addition, as revealed by profiler traces showing an `aten::t` operation that only modifies tensor …

14:08

2026-06-10

github.com

developer-tools

TorchCodec 0.14: HDR Video Decoding for CPU and CUDA, and Fast Wav Decoder

Meta's PyTorch team released TorchCodec 0.14, adding a fast WavDecoder that bypasses FFmpeg for direct WAV file reading and HDR video decoding support for CPU and CUDA with full float32 precision. The…

00:00

2026-06-10

pyrefly.org

machine-learning

Talk: Tensor Shapes in the Type System

Pyrefly, a Python type checker, has introduced an experimental feature that brings tensor shapes into Python's type system, allowing shape annotations to become inferred type hints instead of comments…

15:41

2026-06-09

modular.com

ai-infrastructure

What about OpenCL and CUDA C++ alternatives?

Chris Lattner, a lead engineer on Apple's original OpenCL implementation, explains why OpenCL and other C++-based GPU programming models failed to become dominant in AI, citing slow committee-driven d…

08:37

2026-06-09

marktechpost.com

ai-tools

NVIDIA cuTile Python Tutorial: Building Tiled GPU Kernels for Vector Addition, Matrix Addition, and Matrix Multiplication in Colab

NVIDIA released a tutorial demonstrating how to build tiled GPU kernels for vector addition, matrix addition, and matrix multiplication using cuTile Python in Google Colab. The workflow includes envir…

23:58

2026-06-08

simonwillison.net

artificial-intelligence

Siri AI at WWDC 2026

Apple at WWDC 2026 announced a new generation of Siri AI features, including a custom Gemini-derived model running on its Private Cloud Compute infrastructure and vision LLMs to extract information fr…

12:00

2026-06-08

kdnuggets.com

artificial-intelligence

5 Must-Know Python Concepts for AI Engineers

AI engineers must master five critical Python concepts to build scalable, secure, and robust production systems, including PyTorch's autograd for automatic gradient computation and the `__call__` meth…

08:48

2026-06-06

dev.to

machine-learning

Carbon-Aware Model Training: Scheduling GPU Workloads Around Electricity Carbon Intensity

A developer has built a carbon-aware training pipeline for PyTorch that schedules GPU workloads around real-time electricity carbon intensity, reducing CO2 emissions by delaying training until low-car…

05:01

2026-06-06

dev.to

machine-learning

Why JAX Is a Much Better Backend for Quantum Circuit Simulation Than PyTorch

A developer benchmarked quantum circuit simulation backends using a 20-qubit VQE workload on an NVIDIA RTX 5090 GPU, finding JAX/XLA 12.4x faster than PyTorch and 15.7x faster than TorchQuantum for po…

22:32

2026-06-05

marktechpost.com

machine-learning

A Hands-On Coding Tutorial on Qualcomm AI Hub Models for Classification, Object Detection, and Hardware-Aware Deployment

Qualcomm AI Hub provides an end-to-end workflow for deploying machine learning models on Qualcomm devices, as demonstrated in a new tutorial that walks through MobileNet-V2 classification and YOLOv7 o…

19:30

2026-06-05

gilesthomas.com

machine-learning

JAX backends and devices

JAX defaults to loading data directly onto GPU memory when a CUDA-enabled version is installed, causing out-of-memory errors for large datasets that would fit in system RAM. The framework's `jax.devic…

23:30

2026-06-04

gilesthomas.com

machine-learning

Using Safetensors with Flax

A developer porting PyTorch LLM code to JAX using Flax encountered difficulties when attempting to store model checkpoints with Safetensors, as the library's Flax API expects flat dictionaries but Fla…

22:18

2026-06-04

leanpub.com

large-language-models

Leanpub Book LAUNCH 🚀 My Adventures with Large Language Models: Build foundational LLMs from Transformers to DeepSeek, from scratch, in PyTorch by Prathamesh S.

Prathamesh S. launched a Leanpub book titled 'My Adventures with Large Language Models' that teaches readers to build five LLM architectures from scratch in PyTorch, including GPT-2, Llama 3.2, and De…

04:00

2026-06-04

arxiv.org

autonomous-vehicles

StandardE2E: A Unified Framework for End-to-End Autonomous Driving Datasets

Researchers have released StandardE2E, a unified open-source framework that standardizes preprocessing and data loading across six major autonomous driving datasets, including Waymo and Argoverse. The…

20:39

2026-06-03

lesswrong.com

ai-safety

Aligning Superintelligent Humans

A new approach to AI alignment proposes keeping artificial superintelligence at a manageable capacity by boosting human intelligence and introspection through brain-computer interfaces, rather than tr…

← prev page 5 / 8 next →

// co-occurs with top 8 entities

NVIDIA 26 CUDA 24 JAX 15 Hugging Face 13 Python 12 TensorFlow 12 GPU 11 NumPy 9