{"slug": "minimax-m3-launches-on-nvidia-platform-with-free-endpoint", "title": "MiniMax M3 launches on NVIDIA platform with Free Endpoint", "summary": "MiniMax released its M3 multimodal model on NVIDIA's accelerated infrastructure, offering a free public endpoint via NVIDIA's API catalog. The 428-billion-parameter model processes text, images, and video with a one-million-token context window, using MiniMax Sparse Attention to reduce computational costs and improve speed. The release targets enterprise developers needing scalable, long-context AI for coding, video analysis, and design workflows.", "body_md": "MiniMax M3, a new multimodal model developed by MiniMax, is now available on NVIDIA’s accelerated infrastructure and supports advanced processing of text, images, and video. With 428 billion parameters and a context window of up to one million tokens, the model is engineered for long-context reasoning and complex workflows such as extended coding, video analysis, and design tasks.\n\nThe system’s architecture uses MiniMax Sparse Attention, reducing computational overhead and enabling substantially faster prefill and decoding than its predecessor. It trains natively on multimodal data from the outset, setting it apart from models that add these capabilities after initial training.\n\nThis release targets enterprise developers and organizations seeking to streamline AI application pipelines. MiniMax M3 can be deployed publicly via NVIDIA’s API catalog, with support for leading inference engines such as TensorRT LLM, SGLang, and vLLM. The model’s precision formats (BF16 and MXFP8) and support for up to 128 experts per token optimize performance on NVIDIA hardware, particularly Blackwell GPUs.\n\n**TestingCatalog POV 👀**\n\nMiniMax M3 on NVIDIA is a good chance for everyone to test the model for free. It is especially useful if you want to run a weekend project or save tokens for your 24/7 agents, such as OpenClaw or Hermes.\n\nEarly users and technical experts have noted the considerable efficiency gains and the ability to handle large-scale, multimodal workloads natively, putting MiniMax M3 in direct competition with other large language models in the market. The company’s collaboration with NVIDIA underscores a commitment to scalable, production-grade AI solutions for demanding enterprise environments.", "url": "https://wpnews.pro/news/minimax-m3-launches-on-nvidia-platform-with-free-endpoint", "canonical_source": "https://www.testingcatalog.com/minimax-m3-launches-on-nvidia-platform-with-free-endpoint/", "published_at": "2026-06-12 17:51:29+00:00", "updated_at": "2026-06-12 18:10:03.500934+00:00", "lang": "en", "topics": ["artificial-intelligence", "large-language-models", "ai-infrastructure", "ai-products", "generative-ai"], "entities": ["MiniMax", "NVIDIA", "MiniMax M3", "TensorRT LLM", "SGLang", "vLLM", "Blackwell GPUs", "OpenClaw"], "alternates": {"html": "https://wpnews.pro/news/minimax-m3-launches-on-nvidia-platform-with-free-endpoint", "markdown": "https://wpnews.pro/news/minimax-m3-launches-on-nvidia-platform-with-free-endpoint.md", "text": "https://wpnews.pro/news/minimax-m3-launches-on-nvidia-platform-with-free-endpoint.txt", "jsonld": "https://wpnews.pro/news/minimax-m3-launches-on-nvidia-platform-with-free-endpoint.jsonld"}}