cd /news/artificial-intelligence/amd-s-lemonade-ai-server-now-much-mo… · home topics artificial-intelligence article
[ARTICLE · art-31673] src=phoronix.com ↗ pub= topic=artificial-intelligence verified=true sentiment=↑ positive

AMD's Lemonade AI Server Now Much More Useful With MCP Server Integration

AMD-backed Lemonade AI server released v10.8 with Model Context Protocol (MCP) server integration, enabling MCP-compatible clients like GitHub Copilot and Claude Desktop to use local AI models as tools for chat, transcription, image generation, and multi-modal tasks. The update also adds ROCm support for Radeon 860M, live model management, Moonshine speech-to-text, and experimental NVIDIA GB10 ARM64 support.

read2 min views1 publishedJun 17, 2026

The open-source Lemonade AI server for "100% free and private" AI usage across Windows and Linux in leveraging AMD Ryzen AI NPUs, Radeon GPUs, and x86_64 CPUs, is now much more powerful with today's v10.8 release.

The AMD-backed Lemonade local AI server continues rapidly advancing and with today's v10.8 release it adds Model Context Protocol (MCP) server integration. With Lemonade's MCP server support, any MCP-compatible client can interact with the Lemonade server in treating your locally-running models as tools. GitHub Copilot, Claude Desktop, Cursor, and other MCP-supportive clients can now interact with Lemonade via MCP with capabilities like chat, audio transcription, image generation, and one-shot multi-modal handling with Lemonade Omni.

All the new MCP server capabilities are outlined via the

More details in

Lemonade 10.8 also now enables ROCm for the GFX1152 / Radeon 860M graphics, adds support for live model management, Moonshine speech-to-text, and even experimental NVIDIA GB10 ARM64 support using the Llama.cpp CUDA back-end.

Downloads and more details on Lemonade 10.8 via

The AMD-backed Lemonade local AI server continues rapidly advancing and with today's v10.8 release it adds Model Context Protocol (MCP) server integration. With Lemonade's MCP server support, any MCP-compatible client can interact with the Lemonade server in treating your locally-running models as tools. GitHub Copilot, Claude Desktop, Cursor, and other MCP-supportive clients can now interact with Lemonade via MCP with capabilities like chat, audio transcription, image generation, and one-shot multi-modal handling with Lemonade Omni.

All the new MCP server capabilities are outlined via the

new documentationin Lemonade 10.8."Why it matters. Agents running in a frontier model can now route the privacy-sensitive or high-volume parts of a task to local Lemonade models without leaving the conversation. Bulk classification, on-device transcription and image generation become free, private and offline, orchestrated by whatever host model the user prefers."

More details in

this pull requestadding the MCP server support.Lemonade 10.8 also now enables ROCm for the GFX1152 / Radeon 860M graphics, adds support for live model management, Moonshine speech-to-text, and even experimental NVIDIA GB10 ARM64 support using the Llama.cpp CUDA back-end.

Downloads and more details on Lemonade 10.8 via

── more in #artificial-intelligence 4 stories · sorted by recency
── more on @amd 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/amd-s-lemonade-ai-se…] indexed:0 read:2min 2026-06-17 ·