What’s new in serverless Managed Service for Apache Spark

wpnews.pro

cd /news/artificial-intelligence/whats-new-in-serverless-managed-serv… · home › topics › artificial-intelligence › article

[ARTICLE · art-20571] src=cloud.google.com ↗ pub=2026-06-03T16:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

What’s new in serverless Managed Service for Apache Spark

Google Cloud announced the general availability of its serverless Managed Service for Apache Spark runtime version 3.0, which introduces zero-setup onboarding and reduces startup times by 75%. The update automates IAM permissions, networking, and API management to accelerate first workload launches, and includes Dynamic Workload Scheduler support for GPU availability. Customer use of the service for data science has nearly doubled year over year, reflecting the platform's expanded suitability for AI/ML workloads and SLA-sensitive batch pipelines.

read4 min views13 publishedJun 3, 2026

Whether you use it for data preparation, real-time interactive queries, AI model training, or something entirely different, running Apache Spark at scale is demanding — you shouldn’t have to manage the underlying infrastructure too.

Late last year, we announced the general availability (GA) of our serverless Managed Service for Apache Spark runtime version 3.0, prioritizing speed, simplicity, and reliability. Since then, customer use of Managed Service for Apache Spark for data science has nearly doubled year over year. This is a testament to our belief that using Google Cloud is the easier, smarter, and faster place to run your Apache Spark workloads.

In this blog, let’s dive into a few key features that make our serverless Apache Spark offering a great fit for a wide range of workflows, including feature engineering, GPU-accelerated model training and tuning, semantic search, RAG, building AI agents and applications, and more.

The most significant barrier to entry for a cloud service is often the "time to magic moment" — the interval between creating a project and running your first workload. Previously, with serverless Spark, you still needed to manually configure IAM roles, VPC networking, and firewall rules before submitting a single job.

In the serverless Spark 3.0 runtime version, zero-setup onboarding significantly reduces the time to launch your first workload on serverless Spark. It does so by automating the following steps:

Permissions: Necessary IAM roles and permissions are automatically provisioned to the appropriate service accounts.

Networking: Private Google Access is auto-enabled on subnets, and system firewall policies are configured automatically.

API management: Enabling APIs is now more efficient; you can just enable the Managed Service for Apache Spark API instead of manually having to enable several different APIs, as you did previously.

Latency matters, especially for interactive data science and SLA-sensitive batch pipelines. Historically, serverless Spark startup times could take several minutes. With the 3.0 runtime, we’ve dropped startup times by 75% across both standard and premium tiers, delivered automatically without any code or configuration changes and at no additional cost.

This massive improvement qualifies serverless Spark for a much broader range of SLA-sensitive workloads, and we’re always looking to optimize startup times even further.

"Serverless Spark allowed us to quickly reap benefits by removing the need for fine-grain machine management. This drove faster model development and significantly reduced our data processing costs." - César Narnajo, Principal Engineer, Moloco

Support for Dynamic Workload Scheduler (DWS) Flex Start Mode in the serverless 3.0 runtime version allows serverless Spark to queue customer requests for a configurable duration when GPUs are unavailable. This feature addresses the obtainability challenges for high-demand accelerators like NVIDIA A100 and L4 that are the subject of frequent regional shortages. By pausing workloads until the necessary GPU capacity becomes accessible with DWS, you can dramatically increase obtainability and reliability for your latency-sensitive AI/ML workloads.

The serverless Spark 3.0 runtime version supports current and upcoming Apache Spark 4.x innovations, including Spark Connect, which supports a decoupled client-server architecture that enables remote connectivity from any client.

To protect global enterprise workloads from zonal outages or hardware stockouts, the serverless Spark 3.0 runtime introduces enhanced multi-zonal support by default. The service can now automatically allocate execution nodes across multiple zones within a single region to help ensure obtainability.

Crucially, we do not charge for cross-zonal network traffic between nodes in a region, providing high availability without the traditional multi-zone tax. This is another benefit that you can realize by bringing your global Apache Spark workloads to Google Cloud.

In addition to the above, we’re also continuing to innovate and push the boundaries of ease of use in areas such as history-based autotuning and goal based autoscaling.

You can take advantage of these features today by specifying runtime_version: 3.0 in your batch workloads or interactive sessions. To run your first workload on serverless Spark, perform the following simple steps:

Enable the [Managed Service for Apache Spark API](https://console.cloud.google.com/flows/enableapi?apiid=dataproc).

If you aren’t the project owner, ask your project admin for the serverless Managed Service for Apache Spark [Editor ](https://docs.cloud.google.com/iam/docs/roles-permissions/dataproc#dataproc.serverlessEditor)(`roles/dataproc.serverlessEditor`

) role on the project.

Now you’re ready to start running your workloads on the Serverless 3.0 runtime version. For more details, visit our updated documentation and access serverless Managed Service for Apache Spark in the Google Cloud console.

source & further reading

cloud.google.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/whats-new-in-serverless-…

Read original on cloud.google.com → cloud.google.com/blog/products/data-analytics/se…

mentioned entities

Google Cloud

Apache Spark

Managed Service for Apache Spark

metadata

slugwhats-new-in-serverless-managed-service-for-apache-spark

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalcloud.google.com

navigation

← prevGround truth is a process, not a…

next →No, Artificial Intelligence Is N…

── more in #artificial-intelligence 4 stories · sorted by recency

cloud.google.com · 10 Jun · #artificial-intelligence

Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance

cloud.google.com · 4 Jun · #artificial-intelligence

What's new for Managed Service for Apache Spark clusters

runtimewire.com · 21 Jul · #artificial-intelligence

Cognition launches Devin Outposts, letting its AI agent run on private infrastructure

gritt.ai · 21 Jul · #artificial-intelligence

Gritt: AI to Build Infrastructure with Robotics

── more on @google cloud 3 stories trending now

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required