{"slug": "vox-dictum-on-device-transcription-with-speaker-diarisation-and-ai-summaries", "title": "Vox Dictum, on-device transcription with speaker diarisation and AI summaries", "summary": "Vox Dictum, a new macOS app from Cobalt InFX, offers on-device transcription, speaker diarisation, and AI-generated summaries for audio and video recordings. The software processes all data locally on the user's Mac, ensuring no recordings, transcripts, or personal information are transmitted to any server or third party. The app, available on the Mac App Store for Apple Silicon devices running macOS 14.6 or later, provides a privacy-focused alternative to cloud-based transcription services.", "body_md": "We build intelligent, privacy-first software for professionals who demand precision and control. Every product runs on-device — your data stays yours.\n\nPrivate, on-device transcription for macOS. Import any recording, label speakers, and generate intelligent summaries — all processed locally on your Mac.\n\nDownload on the Mac App Store\n\nmacOS 14.6+ · Apple Silicon (M1 or later)\n\nImport audio or video. AI-powered speech recognition transcribes with high accuracy across 60+ languages. AI summaries available in English and major languages.\n\nAutomatically detect and label speakers. Rename a speaker once — all phrases in that file update. With Pro+, the same voice is recognised across multiple recordings.\n\nGenerate structured summaries tailored to meetings, interviews, podcasts, and more — with key decisions, action items, and speaker contributions. On-device AI — no cloud, no data sharing.\n\nBuilt-in speech enhancement, background noise removal, and silence removal. Better input, better output.\n\nExport as TXT, Markdown, HTML, or SRT subtitles — all with speaker names. Ready for your workflow.\n\nZero data collection. Zero analytics. Zero cloud processing. Your recordings and transcripts never leave your Mac.\n\n| Vox Dictum | MacWhisper | Otter.ai | Dragon | |\n|---|---|---|---|---|\n| On-device processing | ✓ | ✓ | ✗ | ✓ |\n| Zero data collection | ✓ | ✓ | ✗ | ✗ |\n| AI Summary | ✓ On-device | ✗ | Cloud | ✗ |\n| Speaker recognition | ✓ On-device | ✗ | Cloud | ✓ |\n| Overlap detection | ✓ | ✗ | ✗ | ✗ |\n| Audio enhancement | ✓ | ✗ | ✗ | ✗ |\n| Free tier | Unlimited | ✗ | 300 min/mo | ✗ |\n| Price | From £7.99/mo | €59 one-time | $16.99/mo | $699 |\n\n```\nPopular\nPro\n£7.99 /month\nEverything in Free, plus:\n\nAdvanced transcription models\nAI Powered Summary\nOverlap resolution\nCustom vocabulary corrections\nSpeaker reallocation\nTranscript phrase splitting\nBulk transcript export\n🔒 Cobalt InFX products collect no data. Recordings, transcripts, and summaries are processed entirely on your device and are never transmitted to any server.\n```\n\nVox Dictum processes audio and video recordings to produce text transcripts, speaker labels, and AI-generated summaries. All processing is performed locally on your Mac using on-device machine learning models. No audio, text, or metadata is sent to Cobalt InFX, Apple, or any third party during processing.\n\nNothing. Vox Dictum does not collect personal data, usage analytics, crash reports, telemetry, or any other information from your device. The app contains no analytics frameworks, no tracking pixels, and no third-party SDKs that collect data.\n\nThe only network activity in Vox Dictum is: (1) downloading AI models on first use (~2 GB, one-time), (2) downloading additional transcription models when you select a new model size, and (3) Apple verifying your subscription status. No recording data, transcript content, or user-generated content is transmitted over the network at any time.\n\nSpeaker voice matching (Pro+ tier) uses on-device neural network inference to compare voice characteristics within a single processing session. Voice embeddings are computed transiently in memory and discarded when the processing job completes. No biometric data is stored persistently. No voice profiles are created or retained between sessions.\n\nSubscriptions are managed entirely by Apple through the App Store. Cobalt InFX does not process payments, store credit card information, or have access to your Apple ID credentials.\n\nYour transcripts, speaker names, vocabulary corrections, and summaries are stored locally in your Mac's Application Support directory within the app's sandbox. This data is included in Time Machine backups and is deleted when you uninstall the app. Cobalt InFX has no access to this data.\n\nCobalt InFX products are not directed at children under 13. We do not knowingly collect any information from children.\n\nIf we update this policy, we will post the revised version on this page with an updated effective date.\n\nEffective date: April 2026\n\nContact: support@cobaltinfx.com\n\nCheck the built-in Help section in Vox Dictum: Settings → Help. Most common questions are answered there.\n\nModel downloads require a stable internet connection (~2–7 GB depending on features used). Transcription with advanced models may take longer than the recording duration — this is normal for on-device processing.\n\nClick the button below to open your email client. Describe your issue and attach screenshots if relevant.\n\nPlease include:\n\nsupport@cobaltinfx.com", "url": "https://wpnews.pro/news/vox-dictum-on-device-transcription-with-speaker-diarisation-and-ai-summaries", "canonical_source": "https://cobaltinfx.com/", "published_at": "2026-05-31 13:25:04+00:00", "updated_at": "2026-05-31 13:54:13.600862+00:00", "lang": "en", "topics": ["ai-products", "ai-tools", "natural-language-processing", "artificial-intelligence", "ai-startups"], "entities": ["Vox Dictum", "MacWhisper", "Otter.ai", "Dragon", "Mac App Store", "macOS", "Apple Silicon"], "alternates": {"html": "https://wpnews.pro/news/vox-dictum-on-device-transcription-with-speaker-diarisation-and-ai-summaries", "markdown": "https://wpnews.pro/news/vox-dictum-on-device-transcription-with-speaker-diarisation-and-ai-summaries.md", "text": "https://wpnews.pro/news/vox-dictum-on-device-transcription-with-speaker-diarisation-and-ai-summaries.txt", "jsonld": "https://wpnews.pro/news/vox-dictum-on-device-transcription-with-speaker-diarisation-and-ai-summaries.jsonld"}}