{"slug": "synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models", "title": "Synthesize Realistic 3D Medical Images at Scale to Ship Pre‑Trained Models", "summary": "NVIDIA released NV-Generate-MR-Brain, a new open-source model for synthetic 3D brain MRI generation, built on the MAISI architecture. The model was trained on MR-RATE, the world's largest open-source multimodal MRI dataset comprising 100,000 brain studies from over 83,000 patients, to address data scarcity and privacy restrictions in medical imaging AI development. The framework enables researchers to generate realistic 3D volumes with anatomical segmentations at scale for privacy-preserving data augmentation and model training.", "body_md": "High‑quality 3D medical imaging data is the foundation of modern radiology AI, but access to it is often constrained by data scarcity, privacy restrictions, and the high cost of expert annotation. As a result, training reliable 3D medical imaging models is frequently bottlenecked by small, narrow, and hard‑to‑share datasets, limiting model robustness and generalization.\n\nTo help teams overcome these challenges, NVIDIA introduced Medical AI for Synthetic Imaging (MAISI) in 2024—a state‑of‑the‑art generative model that synthesizes high‑resolution 3D CT volumes with pixel‑level anatomical segmentation for privacy‑preserving data augmentation and research.\n\nNV-Generate-CTMR, built on the MAISI architecture family, including MAISI‑v2 with Latent Rectified Flow, delivers an open source, end-to-end framework for synthetic CT and MRI generation. It enables researchers and developers to generate realistic 3D volumes and paired segmentations at scale, integrate them directly into training pipelines, and accelerate downstream medical imaging AI development.\n\nThis blog post introduces NV-Generate-MR-Brain, a new model for the synthetic generation of human brain anatomy and structure segmentation built on the MAISI architecture and extending it toward scalable, open workflows for synthetic 3D medical imaging generation.\n\n## Breaking the 3D medical imaging data bottleneck\n\nNV-Generate-MR-Brain was trained on the newly released multimodal MR-RATE [dataset](https://huggingface.co/datasets/Forithmus/MR-RATE) from University of Zurich, Medipol University Hospital, [Forithmus](https://forithmus.com) and NVIDIA. The MR-RATE dataset builds on the highly successful CT-RATE dataset and multimodal foundation models.\n\nMR-RATE, the world’s largest open source multimodal MRI dataset, comprises 100,000 brain MRI studies from more than 83,000 patients—totaling about 700,000 volumes—each paired with de‑identified radiology reports, clinical, and scanner Digital Imaging and Communications in Medicine (DICOM) metadata. The dataset was created to establish an open, large‑scale foundation for developing both research and commercial AI systems that understand both imaging and clinical context.\n\nMR‑RATE captures the diversity of real‑world neuroimaging practice, spanning different scanner types, imaging protocols, and neurological pathologies. The MR-RATE dataset is being released with an open CC-BY-NC license for research institutions with commercial licenses available through Forithmus.\n\n## Open source by design\n\nThe repository includes end‑to‑end inference code, pretrained weights, and training configurations, enabling teams to get started immediately without rebuilding complex pipelines from scratch. Users can generate synthetic images out of the box or fine‑tune the models on their own datasets to adapt to new anatomies, scanners, or modalities—significantly lowering both technical and compute barriers.\n\nFor this project all of the ingredients including code, data, and models are released with open source licenses with most models being released under the NVIDIA Open Model License. Inferencing for these models can be run on NVIDIA RTX GPUs royalty-free to generate images, fine-tune the model on new data, or new use cases.\n\n## Why image generation is essential for medical AI\n\nMedical image synthesis has rapidly become a core capability for medical AI development. Teams use synthetic data to augment limited training sets, translate between imaging modalities such as CT and MRI, simulate rare pathologies, and enable privacy‑preserving data sharing without exposing real patient information.\n\nBy generating realistic, anatomically consistent 3D volumes—often paired with segmentation labels—synthetic data helps models generalize better when labeled examples are scarce and enables consistent benchmarking across sites, scanners, and protocols.\n\nAs clinical imaging becomes increasingly personalized, heterogeneous, and multimodal, scalable and controllable generation frameworks are no longer optional—they are essential for building robust medical AI systems.\n\n## Limitations of existing medical image synthesis approaches\n\nOver the years, medical image synthesis methods have largely fallen into three categories: direct regression models, generative adversarial network (GAN)‑based approaches, and, more recently, diffusion models that generate images through iterative denoising.\n\nAmong these, diffusion models have emerged as the state of the art, offering improved stability and the ability to model complex anatomical distributions. However, applying diffusion models in real clinical workflows remains challenging.\n\nFirst, real‑world medical images vary widely across scanners, acquisition protocols, and voxel spacings, making it difficult for models trained on narrow datasets to generalize.\n\nSecond, CT and MRI are inherently 3D modalities, yet full 3D diffusion models are computationally expensive in both time and GPU memory.\n\nThird, even when conditioning signals—such as masks or anatomy hints—are provided, generated outputs may not faithfully follow those inputs, limiting their usefulness for controlled or task‑specific generation.\n\nTogether, these challenges—limited generalization, high computational cost, and weak condition alignment—make many existing approaches difficult to deploy at scale, motivating the need for faster and more controllable 3D synthesis frameworks.\n\n## Fast, Open Source 3D Medical Image Synthesis\n\nNV‑Generate‑CTMR is an open‑source framework from NVIDIA designed to make high‑quality 3D medical image synthesis practical for everyday research and development. Rather than treating generative modeling as a narrow, task‑specific solution, it provides a reproducible, ready‑to‑use platform for creating realistic CT and MRI volumes across a wide range of clinical scenarios.\n\nThe framework is the first open‑source medical image generation framework to support flexible voxel sizes, variable volume dimensions, and whole‑body coverage within a single model (shown in Figure below).\n\nThis flexibility allows researchers to synthesize data that matches real clinical protocols—from small, cropped regions to full‑resolution, large‑field‑of‑view scans—without retraining separate models for each setting. In this sense, NV‑Generate‑CTMR behaves as a foundation model for medical imaging, adaptable to many downstream tasks and anatomies rather than being limited to a single organ or configuration.\n\n## Efficient, sustainable AI development\n\nBy sharing models and training details openly, NV‑Generate‑CTMR follows the same philosophy as other open‑source foundation models: reuse instead of retrain.\n\nFine‑tuning an existing model is faster and far more energy‑efficient than training from scratch, reducing development time, lowering electricity consumption, and shrinking environmental impact.\n\n## Under the hood\n\nNV‑Generate‑CTMR contains two model architectures:\n\n**MAISI‑v1**, based on Latent Denoising Diffusion Probabilistic Models (DDPM) for stochastic image generation with better diversity** MAISI‑v2**, based on** Latent Rectified Flow**, for 33x acceleration in inference speed and image generation with better quality\n\nDetails were published in two technical papers: [MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-Specific Contrastive Loss](https://research.nvidia.com/publication/2025-08_maisi-v2-accelerated-3d-high-resolution-medical-image-synthesis-rectified-flow) at the AAAI Conference on Artificial Intelligence in 2026; and [MAISI: Medical AI for Synthetic Imaging](https://arxiv.org/abs/2409.11169) at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) in 2025.\n\n## Fast inference at scale\n\nThe MAISI‑v2 model in NV‑Generate‑CTMR achieves state‑of‑the‑art image quality with much faster inference compared to prior medical image generation methods, while delivering inference speeds comparable to leading video generation models. Table 1, below, breaks down the family of NV-Generate models.\n\nModel Name | ddpm-ct | rflow-ct | rflow-mr | NV-Generate-MR-Brain |\n| Modality | CT | CT | MR | MR |\nRelease date | August 2024 | March 2025 | October 2025 | March 2026 |\nBody Region | Whole body | Whole body | Brain, prostate, abdomen, breast | Whole brain, skull-stripped brain (user can specify) |\n| Architecture | MAISI-v1 | MAISI-v2 | MAISI-v2 | MAISI-v2 |\nInference step | 1000 | 30 | 30 | 30 |\n| Max Volume | 512x512x768 | 512x512x768 | 512x512x128 | 512x512x256 |\nUse case | Image-only generation; image/mask pair generation | Image-only generation; image/mask pair generation | Image-only generation | Image-only generation; cross-contrast generation |\nAdvantages | Better image diversity, whole body coverage | Fast inference speed, better image quality, whole body coverage | Fast inference speed, multiple body region coverage | Fast inference speed, better image quality for brain region |\nLicense | Open source, Commercial friendly | Open source, Commercial friendly | Open source, Research Only | Open source, Commercial friendly |\n\n*Table 1. NV-Generate family of models*## Multi-contrast generation model for brain MRI\n\nBrain MRI is one of the most widely used applications of magnetic resonance imaging. To support this domain, we released NV-Generate-MR-Brain, a generative model built on the MAISI‑v2 architecture and trained on the newly released MR‑RATE dataset.\n\nThe model is designed for high-fidelity brain MRI synthesis and includes a foundation brain MRI model capable of generating either whole-brain or skull‑stripped images based on user specifications. It supports several widely used sequences and contrasts, including T1‑weighted (T1w), T2‑weighted (T2w), FLAIR, and SWI, enabling realistic and flexible image generation for research and clinical training applications.\n\nThe model supports high-resolution volumetric synthesis up to 512 × 512 × 256, approaching the upper range of spatial resolution used in clinical and research brain MRI, enabling realistic full-volume data generation for medical imaging workflows.\n\nIn addition, NV-Generate-MR-Brain provides a ControlNet module for generation of specified anatomical structures or cross-sequence synthesis, enabling users to predict one MRI sequence based on another.\n\n## Real‑world applications and research adoption\n\nImage–mask pairs with tumors generated by NV‑Generate‑CTMR have been used as augmented training data for [ NV Segment](https://huggingface.co/nvidia/NV-Segment-CTMR). Beyond NVIDIA, the models have been used or fine‑tuned by external researchers across a wide range of applications, including:\n\n[Zero-shot anomaly detection](https://openaccess.thecvf.com/content/CVPR2025/papers/Jiang_Advancing_Generalizable_Tumor_Segmentation_with_Anomaly-Aware_Open-Vocabulary_Attention_Maps_and_CVPR_2025_paper.pdf)[Lung CT cancer classification](https://arxiv.org/pdf/2512.18038)[Prostate MRI lesion classification](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11316262)[MR-to-CT synthesis](https://arxiv.org/pdf/2512.23894)[Text prompted CT and MRI tumor segmentation](https://openaccess.thecvf.com/content/CVPR2025/papers/Jiang_Advancing_Generalizable_Tumor_Segmentation_with_Anomaly-Aware_Open-Vocabulary_Attention_Maps_and_CVPR_2025_paper.pdf)[Brain diffusion MRI tractography](https://arxiv.org/pdf/2511.13183)[Brain tumor MRI synthesis](https://arxiv.org/pdf/2510.09365?)[Text-to-CT generation](https://arxiv.org/pdf/2509.14780?)[Text-to-brain MRI generation](https://arxiv.org/pdf/2509.05978?)\n\n“Synthetic, anatomically realistic neuro MR data from NV-Generate, combined with automated segmentation from NV-Segment and clinical reasoning capabilities from NV-Reason, help us design and validate AI solutions more efficiently,” said Ioannis Panagiotelis, PhD, Business Leader MR at Philips. “This enables radiologists to benefit from smarter, more explainable workflows without compromising patient privacy.”\n\n## Try it yourself: Synthesize 3D medical images\n\nThe fastest way to experience NV-Generate-CTMR is to run it yourself.\n\n**Online demo:** No GPU required, you can explore an interactive browser demo hosted by NVIDIA.\n\n**Command Line Interface (CLI):** The online demo showcases core capabilities, but the full experience is available from the GitHub repository, which includes pretrained weights and ready-to-use inference scripts for generating complete 3D CT or MRI volumes locally. After cloning the repo and installing dependencies, you can launch inference with a single command:\n\n```\ngit clone https://github.com/NVIDIA-Medtech/NV-Generate-CTMR.git\n\ncd NV-Generate-CTMR\nexport MONAI_DATA_DIRECTORY=\"./temp_work_dir\"\nnetwork=\"rflow\"\ngenerate_version=\"rflow-ct\"\npython -m scripts.inference \\\n  -t ./configs/config_network_${network}.json \\\n  -i ./configs/config_infer.json \\\n  -e ./configs/environment_${generate_version}.json \\\n  --random-seed 0 \\\n  --version ${generate_version}\n```\n\nThis command loads the pretrained rectified flow model and synthesizes full 3D medical volumes directly to your local workspace. You can then visualize the outputs, inspect paired segmentation masks, or plug the generated data into your own training and evaluation pipelines. An example result of the code block above is shown in Figure 4, below.\n\n## Example results\n\nWhether you’re testing ideas, augmenting datasets, or benchmarking models, NV-Generate-CTMR makes it easy to start generating realistic medical images right away.\n\n## Getting started\n\nCode: [https://github.com/NVIDIA-Medtech/NV-Generate-CTMR](https://github.com/NVIDIA-Medtech/NV-Generate-CTMR)\n\nDataset (research access): [https://huggingface.co/datasets/Forithmus/MR-RATE](https://huggingface.co/datasets/Forithmus/MR-RATE)\n\nModel checkpoints:", "url": "https://wpnews.pro/news/synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models", "canonical_source": "https://developer.nvidia.com/blog/synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models/", "published_at": "2026-05-22 16:00:00+00:00", "updated_at": "2026-05-29 08:08:02.625636+00:00", "lang": "en", "topics": ["generative-ai", "computer-vision", "artificial-intelligence", "machine-learning", "ai-research"], "entities": ["NVIDIA", "MAISI", "NV-Generate-CTMR", "NV-Generate-MR-Brain", "MAISI-v2"], "alternates": {"html": "https://wpnews.pro/news/synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models", "markdown": "https://wpnews.pro/news/synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models.md", "text": "https://wpnews.pro/news/synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models.txt", "jsonld": "https://wpnews.pro/news/synthesize-realistic-3d-medical-images-at-scale-to-ship-pre-trained-models.jsonld"}}