cd /news/machine-learning/braing3n-a-dual-purpose-tokenizer-fo… · home topics machine-learning article
[ARTICLE · art-33529] src=arxiv.org ↗ pub= topic=machine-learning verified=true sentiment=↑ positive

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Researchers introduced BrainG3N, a dual-purpose tokenizer for 3D brain MRI generation that decouples encoder and decoder using a masked-autoencoder, enabling both clinically informative embeddings and anatomically faithful reconstruction. The tokenizer outperformed or matched state-of-the-art models on 21 of 23 clinical tasks and supported controllable generation and longitudinal forecasting, addressing the trade-off between clinical utility and generative fidelity.

read1 min views1 publishedJun 19, 2026

arXiv:2606.19651v1 Announce Type: new Abstract: Three-dimensional (3D) brain MRI is central to clinical neurology and neuro-oncology, where generative models could augment under-represented cohorts, simulate disease trajectories, and support privacy-preserving data sharing. Latent diffusion has been the go-to solution for modeling imaging data, but it places two competing demands on the tokenizer: encoder embeddings must retain the clinical information that downstream tasks act on, and the decoder must reconstruct anatomically faithful volumes. Existing reconstruction-driven tokenizers achieve the second at the expense of the first. To address this, we introduce a fully volumetric masked-autoencoder (MAE) based tokenizer for 3D brain MRI latent diffusion, decoupling encoder and decoder: a frozen 3D MAE encoder produces clinically informative embeddings, while a dedicated CNN decoder reconstructs voxels from a linear projection of those embeddings. We pretrain the encoder on 35,309 volumes from 18 public cohorts spanning four modalities, ten disease categories, and 200+ acquisition sites, and demonstrate its dual utility in two settings. First, on a 23-task linear-probing benchmark, the encoder outperforms or matches SOTA models (i.e., BrainIAC, BrainSegFounder, and MedicalNet) on 21 of 23 tasks. Second, a conditional diffusion transformer (DiT) trained on these clinically informative embeddings supports both conditional generation across six variables and patient-specific longitudinal forecasting. Together these results establish a single 3D brain-MRI embedding space capable of both downstream clinical tasks and controllable generation.

── more in #machine-learning 4 stories · sorted by recency
── more on @braing3n 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/braing3n-a-dual-purp…] indexed:0 read:1min 2026-06-19 ·