{"slug": "whats-new-in-the-ai-platform-agents-for-ml-engineering-our-deep-learning-and-new", "title": "What’s New in the AI Platform: Agents for ML Engineering, Our Deep Learning Platform, and New Capabilities for Real-Time ML", "summary": "Databricks announced new AI platform capabilities at Data + AI Summit 2026, including Genie Code for the full ML lifecycle, AI Runtime for large-scale deep learning training, and Feature and Model Serving for real-time ML. These tools aim to accelerate AI development from experimentation to production.", "body_md": "Use Genie Code to accelerate the full ML Lifecycle, AI Runtime to train large-scale deep learning models, and Feature and Model Serving to power real-time ML at scale.\n\nby [Tejas Sundaresan](/blog/author/tejas-sundaresan) and [Mike Del Balso](/blog/author/mike-del-balso)\n\nThere’s never been a more dynamic, exciting time to be building your own AI models and systems. From demand forecasting and fraud detection to search, recommendations, personalization, and multimodal AI, machine learning is powering critical applications across every industry.\n\nAt Data + AI Summit 2026, we're thrilled to announce the following new capabilities within the Databricks AI Platform:\n\nTogether, these capabilities streamline the path from experimentation to production, enabling organizations to build, deploy, and scale AI applications an significantly faster than ever before.\n\nLet's take a closer look at what's new.\n\nToday, bringing an ML model to production can take months, with teams spending countless hours on repetitive tasks across the ML lifecycle—from feature engineering and experiment management to model evaluation, and deployment. But agents have transformed how engineering and technical teams operate. To that end, at DAIS this year, we’re excited to announce [Genie Code](https://www.databricks.com/blog/introducing-genie-code)’s support for the entire ML lifecycle.\n\nBuilding and operating ML models takes nuanced decisions that generic coding agents can't make. Can I rely on this dataset’s freshness and quality as a feature? Will this feature leak future information into the model? Is this serving endpoint starting to drift? Getting the details in ML right requires deep context, and that context only comes from tight integration with the data and ML platform: your data and its quality, feature lineage, experiment history, training infrastructure, and production performance.\n\nThat's where Genie Code comes in:\n\nAnd so, with Genie Code, your ML teams can move faster than ever before.\n\nGenie Code handles feature engineering the way your senior ML engineer would—learning your team's existing patterns, reusing proven transformations, and building features that are consistent with what's already in production.\n\nGenie Code doesn't just write ML code—it trains and tunes production-grade models. It automatically selects and configures the right infrastructure, whether that's CPU for lightweight experiments or GPU for distributed training, and logs every run natively to MLflow.\n\nGenie Code takes models from notebook to production in one flow—registering to Unity Catalog, deploying to a serving endpoint, and keeping governance intact every step of the way.\n\nGenie Code has completely changed how I work. I run upwards of 15 parallel threads scoped to different notebooks and assets every day, and managing all of that across tabs is one of the biggest sources of friction in my workflow. Full page Genie Code with concurrent sessions would give me a true workspace for running everything in parallel without constantly losing context.— Moritz Schiek, Solution Consultant, Bosch\n\nWith Genie Code, we moved from raw data to a governed, production-ready ML workflow in 90 minutes. Because it uniquely understands production ML workflows on Databricks, it helped us create Delta tables, explore the data, train and compare models, register them with MLflow and Unity Catalog, and deploy the champion model to a serving endpoint, with time left to optimize for the business outcome that mattered most.— Radu Dragusin, Principal Engineer, Data & AI, Danfoss\n\nTo learn more about Genie Code, please get started [here](https://docs.databricks.com/aws/en/genie-code/)!\n\nGPUs power today’s most advanced AI workloads—from forecasting and recommendations to multimodal foundation models. But deep learning teams struggle to procure and manage GPU infrastructure, configure distributed training environments, and resolve performance bottlenecks. They prefer to focus on modeling instead of infrastructure.\n\nIn March, we launched a preview of AI Runtime, and today, we’re excited to share, as part of Data AI Summit, that AI Runtime now supports high performance multinode training. With [AI Runtime](https://www.databricks.com/blog/introducing-ai-runtime-scalable-serverless-nvidia-gpus-databricks-training-and-finetuning), Databricks users now have:\n\nWith this launch, Databricks customers can now leverage the same research-grade GPU platform our own team used to power training of foundation models like [DBRX](https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm) and [KARL](https://www.databricks.com/blog/meet-karl-faster-agent-enterprise-knowledge-powered-custom-rl). Today, AI Runtime now powers frontier workloads for hundreds of Databricks customers – helping bring state-of-the-art AI from research into production enterprise applications.\n\nAttach Serverless A10 and H100 GPUs to your notebook in 2–3 clicks. No cluster management required; only pay for what you use.\n\nUse Genie Code to help resolve performance bottlenecks, experiment with new architectures, or debug tricky bugs around model convergence or cryptic framework errors.\n\nAI Runtime is a production-grade platform for accelerated computing. Develop your deep learning code in interactive notebooks, and then use the full power of Lakeflow to submit and orchestrate jobs on GPU compute.\n\nDatabricks' AI Runtime greatly streamlined the process of training a custom Text To Formula (TTF) model. With no infrastructure setup or delays, it was easy to choose the right compute based on prompt size and output token generation. This allowed us to move quickly, maintain our Lakehouse workflows, and deliver a high-quality model with full governance, reducing time to setup, train and deploy our model from days to hours.— Nikhil Sunderraj, Principal Machine Learning Engineer, FactSet Research Systems, Inc.\n\nTo get started training your next model on GPUs, please see our [examples and documentation](https://docs.databricks.com/aws/en/machine-learning/ai-runtime/) here!\n\nThe most impactful machine learning applications operate in real time: serving recommendations in milliseconds, stopping fraudulent transactions before they're approved, and delivering search results that feel instantaneous.\n\nDeploying a model to production is a delicate balance: every request needs to complete within a few milliseconds, even when traffic spikes – but your costs should stay low when traffic is quiet. Keeping that balance at scale has historically been as hard as building the model itself. Under high QPS, serving infrastructure becomes the bottleneck. Latency grows unpredictable, costs climb, and teams burden their best engineers with re-tuning replica counts, concurrency limits, and autoscaling thresholds every time a model or its traffic shifts.\n\nAt Data + AI Summit, we're announcing new capabilities that eliminate that burden – and streamlines achieving low latency, high QPS serving on Databricks:\n\nCustomers running on Databricks Model Serving have cut infrastructure costs by up to 90%+ versus self-managed stacks, improved p99 and p50 latency by up to 2x, and scaled past 100K QPS in production with little to no maintenance, all with enterprise grade reliability and availability. Leading ML teams like [Grammarly](https://www.databricks.com/blog/how-superhuman-and-databricks-built-200k-qps-inference-platform-together), [GoGuardian](https://www.databricks.com/blog/best-practices-high-qps-model-serving-databricks), and thousands of other customers rely on Databricks to serve their real-time ML systems.\n\nFor your next AI model, please give these new features a try! Learn more in the documentation or our detailed walkthrough blog posts:\n\nSee the AI Platform in action and learn how leading organizations are building and deploying AI models at scale at Data + AI Summit 2026.\n\nSubscribe to our blog and get the latest posts delivered to your inbox.", "url": "https://wpnews.pro/news/whats-new-in-the-ai-platform-agents-for-ml-engineering-our-deep-learning-and-new", "canonical_source": "https://www.databricks.com/blog/whats-new-ai-platform-agents-ml-engineering-our-deep-learning-platform-and-new-capabilities", "published_at": "2026-06-17 08:44:14+00:00", "updated_at": "2026-06-17 13:27:49.535674+00:00", "lang": "en", "topics": ["machine-learning", "ai-infrastructure", "ai-tools", "large-language-models", "generative-ai"], "entities": ["Databricks", "Genie Code", "AI Runtime", "Feature and Model Serving", "Unity Catalog", "MLflow", "Bosch", "Danfoss"], "alternates": {"html": "https://wpnews.pro/news/whats-new-in-the-ai-platform-agents-for-ml-engineering-our-deep-learning-and-new", "markdown": "https://wpnews.pro/news/whats-new-in-the-ai-platform-agents-for-ml-engineering-our-deep-learning-and-new.md", "text": "https://wpnews.pro/news/whats-new-in-the-ai-platform-agents-for-ml-engineering-our-deep-learning-and-new.txt", "jsonld": "https://wpnews.pro/news/whats-new-in-the-ai-platform-agents-for-ml-engineering-our-deep-learning-and-new.jsonld"}}