{"slug": "hermes-agent-adds-virtual-models-for-multi-model-ai-routing", "title": "Hermes Agent adds virtual models for multi-model AI routing", "summary": "Nous Research released Mixture of Agents presets as virtual models in Hermes Agent, allowing users to select multi-model workflows like any other model. The company claims its MoA presets outperform individual frontier models by 8-11% on an upcoming benchmark, but has not yet published the benchmark details. The feature integrates multi-model steps into the agent loop, positioning Hermes as a neutral layer for composing models from multiple providers.", "body_md": "[Nous Research (@NousResearch)](https://x.com/NousResearch) said Friday that [Hermes Agent](https://hermes-agent.nousresearch.com/docs/) now exposes Mixture of Agents presets as virtual models, making a multi-model workflow selectable like any other model inside the agent.\n\nThe move, announced [in a two-post thread on X](https://x.com/NousResearch/status/2070610321278988385), is a product-level attempt to turn a familiar research pattern into a default user interface: run several models first, feed their outputs to an aggregator, and let the aggregator produce the final answer and tool calls. Nous framed the release as a way to get capabilities beyond individual gated frontier models, claiming its MoA presets scored 8% higher than Opus 4.8 and 11% higher than GPT 5.5 on an upcoming HermesBench benchmark.\n\nThat benchmark is the unresolved part of the announcement. Nous said a full HermesBench leaderboard is forthcoming, but did not publish the leaderboard, task mix, evaluation method, sample size, or the exact MoA preset behind the comparison in the X thread. Until those details are public, the percentage gains are Nous' own claim, not an independently checkable result.\n\nWhat is verifiable today is the implementation shape. In the [Hermes Agent MoA documentation](https://hermes-agent.nousresearch.com/docs/user-guide/features/mixture-of-agents), Nous describes Mixture of Agents as a virtual model provider. Each named preset appears under the `moa`\n\nprovider, and the preset can be selected through the same model picker surfaces used by the CLI, gateway, terminal UI, dashboard, and desktop app. In the desktop app, the model dropdown shows an `MoA presets`\n\nsection, according to the docs.\n\nThe architecture matters because it keeps the multi-model step inside Hermes' agent loop rather than treating it as an external prompt hack. Nous says the aggregator is the acting model: it writes the assistant response and emits tool calls. Reference models run first and provide analysis for the aggregator to use. Hermes then treats the aggregator response as the real model response, executes any tool calls normally, and repeats the MoA process on the next model iteration after tool results are added.\n\nThat design is the commercial bet under the feature. Model access has become a distribution problem as much as a capability problem: the strongest models are not equally available to every builder, and even when they are available, they differ in cost, latency, tool-call behavior, and reliability. Nous is positioning Hermes Agent as a layer where users can compose models from multiple providers without rewriting their workflows around each vendor's interface.\n\nHermes Agent already leans into that neutral control-plane role. The [project's GitHub README](https://github.com/NousResearch/hermes-agent) says Hermes can use Nous Portal, OpenRouter, NovitaAI, NVIDIA NIM, Hugging Face, OpenAI, or a user's own endpoint, with model switching handled through `hermes model`\n\n. The same README describes Hermes as a self-improving agent with memory, skill creation, scheduled automations, and messaging surfaces across Telegram, Discord, Slack, WhatsApp, Signal, and CLI. GitHub listed the repository at 204,000 stars and 36,500 forks when checked Friday.\n\nThe MoA feature extends that thesis from model choice to model composition. A user can define presets in `config.yaml`\n\n, through the dashboard, through desktop settings, or with `hermes moa configure`\n\n. The docs show a preset format that separates `reference_models`\n\nfrom an `aggregator`\n\n, including provider and model names for each. The key product detail is that the preset then shows up as a normal model name, not as a separate workflow users have to remember to run.\n\nThat simplicity cuts both ways. A virtual model can make ensemble reasoning feel like a single model call, but the underlying cost and latency still depend on how many reference models run, which providers they hit, and how often the agent loops after tool calls. Nous' docs acknowledge part of that tradeoff by saying reference models receive only conversation text, not the Hermes system prompt or tool-call transcript, so those calls stay cheaper and avoid strict-provider rejections.\n\nFor Nous Research, the timing is also pointed. Hermes Agent's public site now describes the software as open source under the MIT License and says paid Nous Portal tiers include credits for Hermes Agent, access to more than 300 models, and built-in tool use. MoA presets give Nous another reason for users to route work through Hermes and Portal: the value proposition is no longer just access to models, but orchestration across them.\n\nThe stronger claim - that a configured Hermes MoA preset can outperform individual frontier systems on HermesBench - still rests on a leaderboard Nous has not released. The product shift is available to inspect now. The benchmark case remains a company assertion until Nous publishes enough detail for outsiders to reproduce or challenge it.", "url": "https://wpnews.pro/news/hermes-agent-adds-virtual-models-for-multi-model-ai-routing", "canonical_source": "https://runtimewire.com/article/hermes-agent-adds-virtual-models-for-multi-model-ai-routing", "published_at": "2026-06-26 20:52:42+00:00", "updated_at": "2026-06-26 21:09:12.191318+00:00", "lang": "en", "topics": ["ai-agents", "large-language-models", "ai-tools", "ai-infrastructure", "ai-research"], "entities": ["Nous Research", "Hermes Agent", "OpenRouter", "NVIDIA NIM", "Hugging Face", "OpenAI", "GitHub"], "alternates": {"html": "https://wpnews.pro/news/hermes-agent-adds-virtual-models-for-multi-model-ai-routing", "markdown": "https://wpnews.pro/news/hermes-agent-adds-virtual-models-for-multi-model-ai-routing.md", "text": "https://wpnews.pro/news/hermes-agent-adds-virtual-models-for-multi-model-ai-routing.txt", "jsonld": "https://wpnews.pro/news/hermes-agent-adds-virtual-models-for-multi-model-ai-routing.jsonld"}}