Show HN: Running a vision model on every screenshot on-device

wpnews.pro

cd /news/computer-vision/show-hn-running-a-vision-model-on-ev… · home › topics › computer-vision › article

[ARTICLE · art-43401] src=github.com ↗ pub=2026-06-29T12:39Z topic=computer-vision verified=true sentiment=↑ positive

Show HN: Running a vision model on every screenshot on-device

Developer Ayush has released Screenmind, an open-source, privacy-first alternative to Microsoft Recall that runs vision models locally on device. The tool captures and indexes screenshots, enabling search, chat with screen history, and automation, using a three-tier perceptual hash cache to reduce inference by up to 40%. It supports Windows, Mac, and Linux, with rough edges on Mac and multi-monitor support still in development.

read2 min views1 publishedJun 29, 2026

hi author here, Screenmind is privacy first Microsoft recall alternative . It runs on gemma 4 which is one of the fewer models supporting vision audio and reasoning all 3, so your data never leaves you machine.

With screenmind you can keep a track of your timeline , how much time you spent on what..search any screenshot with any text on it.. and the coolest thing, you can chat with your screen history, like what did alex texted me on discord or did i received any mail from Microsoft, if it was on your screen , you can prompt it in the cha. and also you can make automations on top of it, like send me my whole day report on slack(it has integrations )..you can also write automation either though plain English for not so coders or use the python for devs who want to deep dive, and you can save voice memos(with a screenshot) with just a hotkey, and get you meeting transcribed and summarised(auto detects meeting)

the hardest part which i faced was keep running screenmind as a background service it would not have been not hard if chat feature didn't existed, as running local model requires compute ..and keep analyzing screenshots continuously will keep all the resouces hogged up for that i came up with a perceptual has cache .. the three tier cache system reduces inference upto 40% for an average user(which is me)..and to reduce the inference time more i came up with three modes..fast balanced and accurate..where the tradeoff is between time and accuracy

for now i use it daily on my 4gb gtx 1650 with fast mode, works pretty fine also it would be much faster on high end machine , it also has a mcp server so you can just ask claude desktop/cursor about the bug you saw in morning.. supports windows/mac/Linux

being upfront about rough edges , it is not extensively tested on mac and installation has some friction , for which i m working on one click installer thing

(reposting- i put up an earlier version a few months back, comments got flagged cuz of new account so couldn't reply to any )

repo:github.com/ayushh0110/ScreenMind

curious about anyone have idea for how to approach multi monitor support

Comments URL: [https://news.ycombinator.com/item?id=48718498](https://news.ycombinator.com/item?id=48718498)

Points: 3

source & further reading

github.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/show-hn-running-a-vision…

Read original on github.com → github.com/ayushh0110/ScreenMind/blob/main/READM…

mentioned entities

Screenmind

Ayush

Microsoft

Gemma 4

Claude

Cursor

Slack

Discord

metadata

slugshow-hn-running-a-vision-model-on-every-screenshot-on-device

topic#computer-vision

secondary4 topics

sentimentpositive

canonicalgithub.com

navigation

← prevEurope’s Path to Defense Resilie…

next →Gaokao Results Trigger Wave of C…

── more in #computer-vision 4 stories · sorted by recency

kdnuggets.com · 29 Jun · #computer-vision

5 AI Coding Subscription Plans That Give Developers the Best Value

fastcompany.com · 29 Jun · #computer-vision

From static storefronts to decision engines

zdnet.com · 29 Jun · #computer-vision

Want a big tech job? Startups may be your best shot now - here's why

lennysnewsletter.com · 29 Jun · #computer-vision

No Figma. No Jira. No docs. How Gusto built a new product line with Claude Code | Eddie Kim (CTO)

── more on @screenmind 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required