One EXE. No Python. No Docker. 120 Windows automation tools written in Go.

wpnews.pro

cd /news/artificial-intelligence/one-exe-no-python-no-docker-120-wind… · home › topics › artificial-intelligence › article

[ARTICLE · art-45674] src=dev.to ↗ pub=2026-06-30T22:30Z topic=artificial-intelligence verified=true sentiment=↑ positive

One EXE. No Python. No Docker. 120 Windows automation tools written in Go.

A developer built a Windows computer-use MCP server in pure Go, delivering a single 27 MB executable with 120 automation tools for local LLMs. The project, spanning over 14,000 lines of Go, implements vision, memory, and a training pipeline without dependencies on OpenCV or COM libraries. Created through pair-programming with multiple AI models, the tool enables LLMs to control a Windows desktop via mouse, keyboard, and screen interaction.

read3 min views1 publishedJun 30, 2026

I built a Windows computer-use MCP server in pure Go. One EXE. No Python. No Docker.

It's a single 27 MB executable that gives local LLMs (Claude, Gemini, Cursor, Kiro, OpenCode, Ollama, you name it) the ability to actually use a Windows desktop. Think of it as giving an LLM a mouse, keyboard, eyes, and long-term memory.

(Screenshots and demo clips coming soon — wanted to get the post up first.)

Over 14,000 lines of Go, zero dependencies on OpenCV, go-ole, or any COM binding library. Almost every subsystem was implemented from scratch in pure Go instead of wrapping existing libraries.

I'm not a professional software engineer. I built this by pair-programming with multiple AI models across hundreds of iterations.

And I didn't spend a cent on API tokens. Gemini CLI (free tier), Claude Code (trial credits), Ollama (local, free), GitHub Copilot, OpenCode's Big Pickle (200K ctx, free). The ollama launch claude

trick was a workhorse — point Claude Code at a Nemotron or MiniMax3 locally and get agentic scaffolding on budget hardware.

This started as "I want my AI to click a button." It somehow turned into a Windows automation framework with vision, memory, and a training pipeline.

But the real reason? A friend who's disabled and uses Narrator as their primary computer interface. They asked for a month to test once I was ready to go public. After countless trials and Python's "works on my machine" nonsense, I wanted something that actually ships.

120 MCP tools covering the full desktop stack:

find_image

/find_all_images

with triple cascade: template matching → ONNX YOLO → OCRocr_languages

, middle mouse, horizontal scroll, fullscreen detectionBattle-tested with: Claude Desktop, Claude Code, Kiro, Cursor, Windsurf, Gemini CLI, OpenCode, Ollama, Antigravity IDE, Cline, Android Studio, Zed, Obsidian, and more.

syscall.SyscallN(vtblMethod(obj, N), ...) with indices hand-verified against Windows SDK headers.If you're building AI agents, this gives them hands.

If you're building desktop automation, this gives you 120 reusable tools.

If you're learning Windows internals, it shows how raw WinRT COM, OCR, ONNX, and UI automation fit together in a real project.

If you're just curious how far one person can get with modern AI tools, this is my answer.

Yes, it can control your computer. So can AutoHotkey, PowerShell, Selenium, and every RPA tool. This is local-first, every action can be logged, privacy controls toggle at runtime. Not spyware. Not a remote admin tool. Just an automation engine for your own machine.

Especially from people into Go, MCP, local AI, computer vision, automation, or Windows internals. Open issues, suggest features, steal patterns — the repo has templates and a security policy now. Tell me what I broke, what to build next, or that I'm insane for hand-writing COM vtables in Go.

AI didn't build this project. AI became my pair programmer.

The architecture, direction, debugging, testing, and endless "why doesn't Windows do what the docs say?" moments were still mine.

It convinced me that one curious computer technician, persistence, and today's AI tools can build things I wouldn't have been able to create on my own just a few years ago.

Links

GitHub: https://github.com/coff33ninja/go-mcp-computer-use Docs: docs/reference/tools.md

, docs/security.md

, docs/mcp-client-configs.md

source & further reading

dev.to — original article LongCat-2.0 & Agentic AI: Reshaping India's Tech by 2026 Turn Phone Numbers into Lead Signals FOV in FPS Games: The Math Behind Field of View Settings

~/api · this article 200

$curl api.wpnews.pro/v1/news/one-exe-no-python-no-doc…

Read original on dev.to → dev.to/coff33ninja/one-exe-no-python-no-docker-1…

mentioned entities

Claude

Gemini

Cursor

Kiro

OpenCode

Ollama

GitHub

Windows

metadata

slugone-exe-no-python-no-docker-120-windows-automation-tools-written-in-go

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevI Built an LLM Gateway That Exte…

next →Optimizing Speed and Accuracy in…

── more in #artificial-intelligence 4 stories · sorted by recency

github.com · 30 Jun · #artificial-intelligence

Show HN: Mimir – local-first encrypted memory for AI agents (single Rust binary)

dev.to · 30 Jun · #artificial-intelligence

I Built an LLM Gateway That Extends Claude Pro/Max Users with Azure AI Foundry, Amazon Bedrock, Local Models

dev.to · 30 Jun · #artificial-intelligence

Claude Sonnet 5 Just Made Running Agents Cheap — What Builders Actually Need to Know

dev.to · 30 Jun · #artificial-intelligence

OpenTelemetry Tells You What Your Agent Did. Not Whether It Was OK.

── more on @claude 3 stories trending now

wpnews · 30 May · #ai-tools

I was wasting 10 minutes every Claude session. So I built a fix.

wpnews · 27 May · #machine-learning

hunting for headroom on modded-nanoGPT (WR #82)

wpnews · 2 Jun · #ai-products

Microsoft launches Discovery platform for scientific R&D with Ginkgo Bioworks partnership

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required