{"slug": "nvidia-cuda-13-3-rolls-out-cuda-python-1-0-cuda-tile-for-c", "title": "Nvidia CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile for C++", "summary": "NVIDIA released CUDA 13.3 on Tuesday, marking the CUDA Python 1.0 milestone as a stable, supported release for leveraging CUDA in Python applications for AI, data science, and scientific computing. The update also introduces CUDA Tile for C++ and the CompileIQ compiler auto-tuning framework, which can deliver up to 15% speed-ups on kernels like GEMM and attention.", "body_md": "# NVIDIA CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile For C++\n\nNVIDIA on Tuesday released CUDA 13.3 as another significant advancement for their unified GPU programming stack for NVIDIA hardware.\n\nFor those wanting to tap the power of CUDA from the Python programming language, CUDA 13.3 marks the CUDA Python 1.0 milestone as a stable, supported means of being able to leverage CUDA in Python apps for AI, data science, scientific computing, and related uses.\n\nFor C++ fans, CUDA 13.3 brings CUDA Tile for C++ in bringing the\n\nIn addition to these programming enhancements, CUDA 13.3 also introduces the CompileIQ compiler auto-tuning framework that can provide up to 15% speed-ups on kernels like GEMM and attention.\n\nCUDA 13.3 also brings a Numba CUDA MLIR back-end , various math library updates, C++23 support in the NVCC and NVRTC code, mmap() support, and other improvements.\n\nMore details on this CUDA 13.3 feature update via the\n\nFor those wanting to tap the power of CUDA from the Python programming language, CUDA 13.3 marks the CUDA Python 1.0 milestone as a stable, supported means of being able to leverage CUDA in Python apps for AI, data science, scientific computing, and related uses.\n\nFor C++ fans, CUDA 13.3 brings CUDA Tile for C++ in bringing the\n\n[CUDA Tile](https://www.phoronix.com/search/CUDA+Tile)programming model to the C++ world.In addition to these programming enhancements, CUDA 13.3 also introduces the CompileIQ compiler auto-tuning framework that can provide up to 15% speed-ups on kernels like GEMM and attention.\n\nCUDA 13.3 also brings a Numba CUDA MLIR back-end , various math library updates, C++23 support in the NVCC and NVRTC code, mmap() support, and other improvements.\n\nMore details on this CUDA 13.3 feature update via the", "url": "https://wpnews.pro/news/nvidia-cuda-13-3-rolls-out-cuda-python-1-0-cuda-tile-for-c", "canonical_source": "https://www.phoronix.com/news/NVIDIA-CUDA-13.3-Released", "published_at": "2026-05-27 21:21:23+00:00", "updated_at": "2026-05-27 21:44:31.673868+00:00", "lang": "en", "topics": ["ai-infrastructure", "ai-tools", "machine-learning", "artificial-intelligence"], "entities": ["NVIDIA", "CUDA 13.3", "CUDA Python 1.0", "CUDA Tile", "CompileIQ", "Numba", "NVCC", "NVRTC"], "alternates": {"html": "https://wpnews.pro/news/nvidia-cuda-13-3-rolls-out-cuda-python-1-0-cuda-tile-for-c", "markdown": "https://wpnews.pro/news/nvidia-cuda-13-3-rolls-out-cuda-python-1-0-cuda-tile-for-c.md", "text": "https://wpnews.pro/news/nvidia-cuda-13-3-rolls-out-cuda-python-1-0-cuda-tile-for-c.txt", "jsonld": "https://wpnews.pro/news/nvidia-cuda-13-3-rolls-out-cuda-python-1-0-cuda-tile-for-c.jsonld"}}