Offline Video Search, Scene Segmentation, Speech Transcription (Whisper), Speaker Diarization, and SQLite + Qdrant Semantic Search on Windows 11
(Note: The optional Ask GoodQ voice agent is a hosted extension using ElevenLabs APIs. The core GoodQ4All memory system itself is 100% local and offline.)
GoodQ4All is a 100% private, offline alternative to cloud-based media intelligence services. It ingests video, audio, and text files into queryable, structured scene-level memories, persisting the knowledge graph and vector representations locally on your computer.
Following a strict "proof-backed" system doctrine, GoodQ4All documents every perception step, tracks evidence manifests, and logs a comprehensive audit trail so that every memory claim can be verified.
Get This Level of Local Control (Unified Operator UI):
Click the preview above to watch the high-fidelity onboarding video. - From Video Quality as Low as This: (Raw Media Inputs):
Apollo 11 Moon Walk (nasa_descent.gif)
Saturn V Launch (nasa_launch.gif) - Using This All-in-One Installer (Unified Windows Installer):🚀 Download GoodQ4All Setup v2.4.0.exe[!IMPORTANT]
System Requirement: Windows 11 only. GoodQ4All is built for Windows-first local execution. It requires at least25 GB of free space to store local database structures, models, and cache files.SmartScreen Workaround: Since the setup installer is currently self-signed, Windows SmartScreen may show an "Unknown Publisher" dialog. ClickMore info and selectRun anyway to proceed.Integrity Checksum: Verify your download authenticity by running the following command in PowerShell:Expected SHA256 hash: Refer to the GitHub Releases page for the latest signed executable checksum.
Get-FileHash GoodQ4All_Setup_2.4.0.exe
Machine memory should earn every claim it makes. GoodQ4All generates step-by-step logs (step_runs.jsonl
), scene manifests, and intermediate features for every ingested file.
Single-Use Confirmation Tokens: Ingestion submission routes are protected by a server-generated token handshake with single-use nonce validation, preventing unauthenticated/out-of-bounds execution.Epistemic Verification: Ingestion is tracked using verifiable manifests and SQLite-backed relational schemas.** No Silent Failures**: The Control Agent and Watchdog processes bubble errors directly to the operator consoles, providing absolute visibility into the execution stack.
To run large-parameter local models safely on consumer hardware (e.g. RTX 4070 Ti SUPER 16GB) without Out-of-Memory (OOM) crashes, GoodQ4All implements strict VRAM and execution controls:
Model Lifecycle Manager: A specialized context manager (lib/model_lifecycle.py
) that audits free VRAM using PyTorch andnvidia-smi
before models, dynamically evicting idle networks from GPU memory.Local Agent Stack (: Gated LLM reasoning and local tool execution through zero-dependency policy enforcement middleware, schemas, policies, and contracts dynamically from the version-controlledMiniAgentClient
)agents/stack/
directory.Endpoint Fallback Orchestration: Automatically falls back from the primary local vLLM server (prefer_speed
, running Qwen2.5) to a local Ollama service (prefer_quality
, running Phi-4) or a CPU-safe model variant when VRAM thresholds are breached.
High-precision 32-bit floating point embeddings are persisted in Qdrant and FAISS. For rapid candidate filtering, GoodQ4All uses TurboQuant—an SQLite sidecar caching technology employing Lloyd-Max Polar Quantization and Johnson-Lindenstrauss residual projections.
Performance: Achieves sub-millisecond candidate pre-filtering.Accuracy: 100% search accuracy is maintained by performing the final rank scoring on the uncompressed raw float32 vectors.
Note
Hybrid Precision Caching Model:
GoodQ4All uses an additive sidecar vector cache architecture. High-precision 32-bit floating point (float32
) embeddings remain the authoritative truth of the system, stored in Qdrant and FAISS. Performance-oriented query pre-filtering is handled via lightweight TurboQuant fields (Lloyd-Max Polar Quantization + Johnson–Lindenstrauss residual corrections) stored in SQLite. This ensures zero data loss, guarantees rollback capability, and cuts memory usage.
The pipeline dynamically adjusts its computational needs to match your system specs:
BASELINE
(CPU-safe): Fully operational, offline-ready execution on standard CPU hardware. Bypasses GPU requirements gracefully.GPU_ENHANCED
: Activates local NVIDIA GPU (CUDA 12.1) and WSL2 accelerated audio processing paths for fast, high-volume ingestion.
GoodQ4All compiles the isolated Python environment, the Qdrant database, and perception libraries into a single executable wrapper:
- Download and run
GoodQ4All_Setup_1.0.0.exe
. - Launch
GoodQ4All from the desktop shortcut. - Open the local
Retro Memory Explorer dashboard athttp://127.0.0.1:30000/ui/retro_console_v1/
. - Drag-and-drop video/audio files onto the yellow-dotted Upload Pad to begin automatic ingestion.
If you are developing, customizing the pipeline, or running from source:
Developer Source Setup Steps (Advanced)
GoodQ4All ships with two local operator console variants:
Classic Operator Console(served at/ui/operator_console_v1/
): Exposes the current scope strip, flight deck, proof/evidence status, recurrence reports, and video inventories.Retro Memory Explorer (v1.4.7)(served at/ui/retro_console_v1/
): A premium cyber-CRT dashboard featuring a four-panel resizable/collapsible layout with floating restore tabs, an entity co-occurrence graph with dynamic zoom and flight transitions, an Inspector panel containing keyframe image/transcript views, and bidirectional timeline checklists.
-
Guided demo:
docs/guides/DEMO.md -
First run:
docs/guides/FIRST_RUN.md -
Install:
docs/guides/install/INSTALL.md -
Quickstart:
docs/guides/install/QUICKSTART.md -
Clean memory start:
docs/guides/CLEAN_MEMORY_START.md -
Data Privacy:
docs/guides/general/PRIVACY.md -
Architecture:
docs/architecture/SYSTEM_ARCHITECTURE.md -
Memory Storage:
docs/architecture/MEMORY_STORAGE.md -
Hybrid Caching:
docs/architecture/TURBOQUANT_HYBRID_CACHING.md -
Current Agent State:
docs/agent/CURRENT_STATE.md
MIT. See LICENSE.