Polars GPU Engine

wpnews.pro

cd /news/ai-infrastructure/polars-gpu-engine · home › topics › ai-infrastructure › article

[ARTICLE · art-24515] src=docs.rapids.ai ↗ pub=2026-06-11T21:40Z topic=ai-infrastructure verified=true sentiment=· neutral

Polars GPU Engine

NVIDIA's cuDF library now provides GPU-accelerated execution engines for the Polars Lazy API, enabling users to run dataframe operations on GPUs with automatic fallback to CPU when operations are unsupported. The engine, available in Open Beta through the cudf-polars package, delivers significant speedups for datasets exceeding hundreds of gigabytes and supports scaling from single GPU to multi-GPU and multi-node configurations.

read2 min views27 publishedJun 11, 2026

cuDF provides GPU-accelerated execution engines for Python users of the Polars Lazy API. The engines support most of the core expressions and data types as well as a growing set of more advanced dataframe manipulations and data file formats. When a GPU engine is selected, Polars converts expressions into an optimized query plan and determines whether the plan is supported on the GPU. If it is not, the execution transparently falls back to the standard Polars engine and runs on the CPU.

Install# #

Follow the RAPIDS installation guide and pick the cudf-polars

package for your CUDA and Python versions. For example, with conda:

conda install -c rapidsai -c conda-forge -c nvidia cudf-polars

Or with pip (CUDA 13 wheels; use cudf-polars-cu12

for CUDA 12):

pip install cudf-polars-cu13

Quick start# #

RayEngine with no arguments uses every GPU visible to the process, so the same code runs on one GPU and scales to multi-GPU / multi-node setups automatically:

import polars as pl
from cudf_polars.engine.ray import RayEngine

query = (
    pl.scan_parquet("/data/dataset/*.parquet")
    .filter(pl.col("amount") > 100)
    .group_by("customer_id")
    .agg(pl.col("amount").sum())
)

with RayEngine() as engine:
    result = query.collect(engine=engine)

See Usage for the full tutorial, Engines for a conceptual overview of the available engines, Configuration Options for the StreamingOptions configuration, and

Understanding Memory Use in the GPU Streaming Enginefor guidance on out-of-memory errors and memory tuning.

Benchmark# #

Polars delivers high performance across a wide range of data scales through multiple execution engines. The default CPU engine is highly optimized for interactive and medium-scale analytics on a single node. The Polars GPU engine lets you move seamlessly to GPU nodes, providing meaningful acceleration when your dataset grows to hundreds of gigabytes or more.

We ran the Polars Decision Support (PDS) benchmarks to compare the Polars GPU engine with the CPU engine at larger scale factors to show how the GPU engine delivers meaningful speedups as dataset size grows:

On a single GPU, you can run TB-scale workloads with significant speedups compared to running on CPU. You can also scale up to run on multiple GPUs for processing even larger workloads:

For more information on the benchmarks being run, see the PDS-DS queries in the cuDF GitHub repository.

Learn More# #

The GPU engine for Polars is now available in Open Beta and the engine is undergoing rapid development. To learn more, visit the GPU Support page on the Polars website.

source & further reading

docs.rapids.ai — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/polars-gpu-engine

Read original on docs.rapids.ai → docs.rapids.ai/api/cudf/stable/cudf_polars/

mentioned entities

cuDF

Polars

RAPIDS

CUDA

RayEngine

metadata

slugpolars-gpu-engine

topic#ai-infrastructure

secondary2 topics

sentimentneutral

canonicaldocs.rapids.ai

navigation

← prevHacking Google with A.I. For $50…

next →Deep learning four decades of hu…

── more in #ai-infrastructure 4 stories · sorted by recency

arxiv.org · 9 Jul · #ai-infrastructure

Breaking Database Lock-In: Agentic Regeneration of Storage Readers for Databases

cryptobriefing.com · 28 Jul · #ai-infrastructure

Visa deploys Anthropic’s Claude Mythos to hunt vulnerabilities across its global payment network

dev.to · 28 Jul · #ai-infrastructure

AgentENV: Distributed Runtime for AI Agents at Scale (Open Source, Rust)

blogs.cisco.com · 28 Jul · #ai-infrastructure

A High-IQ Network Isn’t a Smart Network. Here’s What’s Missing.

── more on @cudf 3 stories trending now

wpnews · 26 Jul · #artificial-intelligence

Nobel laureate Simon Johnson on the AI race and China’s ‘over-automation’ problem

wpnews · 26 Jul · #artificial-intelligence

China’s Moonshot, Z.AI, and DeepSeek are challenging U.S. AI labs—and beating them on cost

wpnews · 26 Jul · #ai-safety

University of Washington study reveals prompt injection risks lurking in AI agent memory

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required