OpenAI's Whisper model runs accurate speech-to-text on a laptop GPU, and the C++ port runs it without Python or a cloud account. That capability sits unused while teams pay per-minute fees to upload their calls to a transcription service. Off Grid AI Desktop is a free, open-source app that records and transcribes your meetings directly on your Mac or PC.
Free, open-source, runs offline. No account, no API key, no telemetry.
You want a searchable record of what was said, not a subscription to a notetaker bot. Here is the difference local transcription makes.
The audio never leaves your machine, so a confidential call stays confidential. There is no per-minute meter, so a three-hour planning session costs the same as a five-minute standup, which is nothing. The transcript and summary save to your disk, so you own the record instead of renting access to it.
Whisper comes in several sizes. The bigger ones transcribe better and want more memory.
| Tier | macOS | Windows | RAM / VRAM | Free disk |
|---|---|---|---|---|
| Minimum | Apple Silicon M1 | Any modern CPU or iGPU | 8 GB | 10 GB |
| Recommended | M2 / M3 / M4 | NVIDIA RTX (6 GB+ VRAM) | 16 GB+ | 20 GB |
On the minimum tier, a smaller Whisper model transcribes a one-hour call in a few minutes after it ends. The recommended tier runs the larger, more accurate models and finishes faster. CPU transcription works everywhere; a GPU just speeds it up.
The recorder captures three streams at once: the screen video, the system audio coming out of your speakers, and your microphone. That covers both sides of a Google Meet or Zoom call, the remote participants on system audio and you on the mic.
When you stop, the bundled whisper.cpp
engine transcribes the captured audio locally. There is no upload step. The transcription is the same model the cloud services use, running on your own hardware.
After the transcript is ready, the local LLM reads it and generates a title, a summary, and a list of the people mentioned. You get a usable meeting note without rewatching the recording or paying a notetaker.
Whisper ships as quantized weights, which stores the model at lower precision so it fits in consumer RAM without losing meaningful accuracy. That is the change that moved good transcription off the cloud and onto a laptop.
On macOS, transcription runs through Metal against Apple Silicon's unified memory. On Windows, you get CUDA on NVIDIA cards or a CPU path when there is no GPU. Either way the math runs locally, so the longer a call runs, the more you save versus a per-minute service.
Audio quality drives transcription quality more than model size does. Use a headset or a decent mic so your own track is clean, and the result improves immediately.
Pick the model to match the job. A smaller Whisper model is fine for a quick internal sync where you just need the gist. Reach for a larger model when accuracy matters, like a client call you will quote from later. Recording system audio captures the remote side at the source, which beats a microphone picking it up off your speakers.
A cloud notetaker uploads your full meeting audio and video to a server you do not control. The recording, the transcript, and the summary all live there. Off Grid AI Desktop keeps every part of that on your machine, because there is no server in the path.
It is AGPL-3.0 licensed, so the code is auditable. There is no account and no telemetry. The recorder requires an explicit start and stop, and a visible recording indicator stays on the whole time, so nothing records silently in the background.
git clone https://github.com/off-grid-ai/desktop
cd desktop
npm install
npm run dev
Yes. The app is free and open-source under AGPL-3.0. There are no per-minute charges and no subscription.
Yes. Recording and transcription both run with no network connection.
Yes. It records the screen, system audio, and mic, so it captures both sides of any call in your browser or a desktop client.
It runs Whisper, the same model behind many cloud transcription tools. Accuracy depends on the model size you pick and your audio quality.
8 GB runs a smaller Whisper model. 16 GB or more is comfortable for the larger, more accurate ones.
Yes. The audio, transcript, and summary stay on your machine. Nothing uploads.
Transcribe your calls on hardware you already own. GitHub →