{"slug": "i-built-a-desktop-chat-app-for-running-local-llms-offline", "title": "I Built a Desktop Chat App for Running Local LLMs Offline", "summary": "Creation of Openbench AI, a desktop chat application designed to run local large language models (LLMs) offline via Ollama, eliminating the need for cloud services or complex setup. The app allows users to run multiple models simultaneously for direct comparison, supports temporary chats, and enables in-app model installation, all while being open source and available as a pre-built release.", "body_md": "A few months ago I got tired of bouncing between ChatGPT, Claude, and a dozen other AI chat UIs every time I wanted to test a different model. It felt like productivity, if productivity involved tab overload and constant context switching.\nI also wanted to run everything locally without sending data off to someone else’s server just to ask a question about JSON formatting.\nSo I built Openbench AI.\nOpenBench is a desktop chat app that connects to Ollama and lets you talk to local LLMs without the usual ritual of Docker, Python environments, or “why is this port not working again” debugging sessions.\nYou install it. You open it. You chat.\nThat’s it.\nYou can run multiple models at once and watch them respond in real time to the same prompt.\nInstead of guessing which model is best, you can just compare them directly under identical conditions like a mildly scientific experiment that occasionally exposes how inconsistent models can be.\nFull rendering via KaTeX.\nSo when a model writes equations or structured explanations, it actually looks like something a human might willingly read.\nTemporary chats that disappear when you close the app.\nNo storage. No accounts. No memory overhead.\nJust quick experiments without commitment.\nYou can install models directly inside the app instead of juggling terminal commands and hoping you typed everything correctly the first time.\nIf I rebuilt it, I’d add OpenAI-compatible API support from day one.\nRight now it only works with Ollama, though the architecture already supports additional providers. I just haven’t wired them in yet because priorities are a fictional concept.\nThe project is open source, and release builds are available if you want to try it without compiling anything yourself.\nSomewhere between building multi-model streaming and debugging Rust event bridges, I learned that the world did not strictly need another chat UI.\nSo I made one anyway.\nFor what it’s worth, it behaves better than most tools that have significantly more funding and optimism.\nAnd yes, it works just fine for a gay developer trying to wrangle multiple AI models without losing their mind to tool fragmentation.\nWhich is probably more honesty than most software documentation is legally allowed to contain.", "url": "https://wpnews.pro/news/i-built-a-desktop-chat-app-for-running-local-llms-offline", "canonical_source": "https://dev.to/theoslater/i-built-a-desktop-chat-app-for-running-local-llms-offline-6im", "published_at": "2026-05-20 15:16:48+00:00", "updated_at": "2026-05-20 15:32:57.150371+00:00", "lang": "en", "topics": ["large-language-models", "developer-tools", "open-source", "products"], "entities": ["Openbench AI", "Ollama", "ChatGPT", "Claude", "KaTeX"], "alternates": {"html": "https://wpnews.pro/news/i-built-a-desktop-chat-app-for-running-local-llms-offline", "markdown": "https://wpnews.pro/news/i-built-a-desktop-chat-app-for-running-local-llms-offline.md", "text": "https://wpnews.pro/news/i-built-a-desktop-chat-app-for-running-local-llms-offline.txt", "jsonld": "https://wpnews.pro/news/i-built-a-desktop-chat-app-for-running-local-llms-offline.jsonld"}}