cd /news/ai-products/multimodal-max · home topics ai-products article
[ARTICLE · art-18911] src=arena.ai pub= topic=ai-products verified=true sentiment=↑ positive

Multimodal Max

Arena’s model router Max, powered by over 5 million community votes, is now multimodal and available as the default chat option across all modalities. The updated system supports search, vision, image generation, image editing, and front-end coding, with latency controls designed to maintain fast performance. Benchmarks show Max achieving Pareto frontier performance across most supported arenas, outperforming its routing set in text, search, vision, code, and text-to-image tasks while offering significant speed improvements.

read4 min publishedMay 5, 2026

Research Max, Arena's model router powered by 5M+ community votes, is now multimodal. Starting today, Max will be available as the default option in direct chat for all modalities, with expanded capabilities including search, vision, image generation, image editing, and front-end coding. Similar to our original Max for text, the multimodal variants are latency-controlled to provide a fast and performant experience. Try it now at arena.ai/max!

The benchmarks below show Max's performance across the Arena leaderboards most relevant to its capabilities. Because Max is a router, the models we compare Max to reflect a point-in-time snapshot of which models were publicly available and routable when Max was last trained and evaluated. Max is updated periodically to incorporate the latest frontier models.

Max holds Pareto frontier performance when compared to its routing set across every modality it covers. It outranks all other models in this set for every supported arena except Single-Image Edit and Multi-Image Edit, where it places second. In these two arenas, Max offers a large latency benefit over the top model.

Top 5 models selected across all text modality prompts for Max

Max demonstrates strong performance on text, improving the time-to-first-token by more than 9 seconds compared to the next best model. The routing distribution is diverse, suggesting Max’s ability to leverage the various strengths of different models.

Top 5 models selected across all search modality prompts for Max

In Search, we find similar results, with Max able to achieve top performance on the leaderboard. The routing distribution is more concentrated on fewer models which are both strong in performance and in latency.

Top 5 models selected across all vision modality prompts for Max

In Vision Arena, Max outperforms the best model at the time by 3 points while providing more than a 20 second speedup. The routing distribution shows Max strongly relying on gpt-5.2-chat-latest

for 62% of battle prompts, but the 38% of prompts routed elsewhere netted Max a 12 point gain in strength. Top 5 models selected across all code modality prompts for Max

In Code Arena, Max once again beats out its routing choices in terms of performance. Here we focused on making Max faster as measured by end-to-end latency since the output is only presented to the user upon full completion. Max leans more on claude-opus-4-5

variants than might be expected, largely due to gains in e2e latency.

Top 5 models selected across all text-to-image modality prompts for Max

Max in the Text-to-Image modality had an extremely strong performance, outperforming the top models in its routing set both on model strength and latency. The routing distribution leaned towards gemini-3.1-flash-image-preview

, but the remaining routing choices were diversely spread through multiple models.

Top 5 models selected across all image-edit modality prompts for Max

In the Single-Image Edit Arena, Max performed well, providing a faster but still strong alternative to gpt-image-2 (medium)

. Because the strength-latency tradeoff was configured to have a heavier emphasis on strength, Max still heavily relied on the top model. Interestingly, the routing distribution for Max in Image Edit Arena largely only consisted of other models on the pareto frontier, showing Max is able to identify models with the strongest latency/performance tradeoff.

Top 5 models selected across all multi-image-edit modality prompts for Max

Finally, on Multi-Image Edit Arena, Max also landed as a faster but still strong alternative to gpt-image-2 (medium)

. In this case, the strength-latency tradeoff was more heavily weighted towards speed, giving Max a 22 second speedup over gpt-image-2 (medium)

.

With these expanded capabilities, Max can be used for more diverse tasks. Whether you want to generate a graphic, interpret a chart, or make a live website, Max can handle it. Go give the new and improved Max a try!

── more in #ai-products 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/multimodal-max] indexed:0 read:4min 2026-05-05 ·