Build a full knowledge graph locally with any video

wpnews.pro

Offline Video Search, Scene Segmentation, Speech Transcription (Whisper), Speaker Diarization, and SQLite + Qdrant Semantic Search on Windows 11

(Note: The optional Ask GoodQ voice agent is a hosted extension using ElevenLabs APIs. The core GoodQ4All memory system itself is 100% local and offline.)

GoodQ4All is a 100% private, offline alternative to cloud-based media intelligence services. It ingests video, audio, and text files into queryable, structured scene-level memories, persisting the knowledge graph and vector representations locally on your computer.

Following a strict "proof-backed" system doctrine, GoodQ4All documents every perception step, tracks evidence manifests, and logs a comprehensive audit trail so that every memory claim can be verified.

Get This Level of Local Control (Unified Operator UI):

Click the preview above to watch the high-fidelity onboarding video. - From Video Quality as Low as This: (Raw Media Inputs):

Apollo 11 Moon Walk (nasa_descent.gif)

Saturn V Launch (nasa_launch.gif) - Using This All-in-One Installer (Unified Windows Installer):🚀 Download GoodQ4All Setup v2.4.0.exe[!IMPORTANT]

System Requirement: Windows 11 only. GoodQ4All is built for Windows-first local execution. It requires at least25 GB of free space to store local database structures, models, and cache files.SmartScreen Workaround: Since the setup installer is currently self-signed, Windows SmartScreen may show an "Unknown Publisher" dialog. ClickMore info and selectRun anyway to proceed.Integrity Checksum: Verify your download authenticity by running the following command in PowerShell:Expected SHA256 hash: Refer to the GitHub Releases page for the latest signed executable checksum.

Get-FileHash GoodQ4All_Setup_2.4.0.exe

Machine memory should earn every claim it makes. GoodQ4All generates step-by-step logs (step_runs.jsonl

), scene manifests, and intermediate features for every ingested file.

Single-Use Confirmation Tokens: Ingestion submission routes are protected by a server-generated token handshake with single-use nonce validation, preventing unauthenticated/out-of-bounds execution.Epistemic Verification: Ingestion is tracked using verifiable manifests and SQLite-backed relational schemas.** No Silent Failures**: The Control Agent and Watchdog processes bubble errors directly to the operator consoles, providing absolute visibility into the execution stack.

To run large-parameter local models safely on consumer hardware (e.g. RTX 4070 Ti SUPER 16GB) without Out-of-Memory (OOM) crashes, GoodQ4All implements strict VRAM and execution controls:

Model Lifecycle Manager: A specialized context manager (lib/model_lifecycle.py

) that audits free VRAM using PyTorch andnvidia-smi

before models, dynamically evicting idle networks from GPU memory.Local Agent Stack (: Gated LLM reasoning and local tool execution through zero-dependency policy enforcement middleware, schemas, policies, and contracts dynamically from the version-controlledMiniAgentClient

)agents/stack/

directory.Endpoint Fallback Orchestration: Automatically falls back from the primary local vLLM server (prefer_speed

, running Qwen2.5) to a local Ollama service (prefer_quality

, running Phi-4) or a CPU-safe model variant when VRAM thresholds are breached.

High-precision 32-bit floating point embeddings are persisted in Qdrant and FAISS. For rapid candidate filtering, GoodQ4All uses TurboQuant—an SQLite sidecar caching technology employing Lloyd-Max Polar Quantization and Johnson-Lindenstrauss residual projections.

Performance: Achieves sub-millisecond candidate pre-filtering.Accuracy: 100% search accuracy is maintained by performing the final rank scoring on the uncompressed raw float32 vectors.

Note

Hybrid Precision Caching Model: GoodQ4All uses an additive sidecar vector cache architecture. High-precision 32-bit floating point (float32

) embeddings remain the authoritative truth of the system, stored in Qdrant and FAISS. Performance-oriented query pre-filtering is handled via lightweight TurboQuant fields (Lloyd-Max Polar Quantization + Johnson–Lindenstrauss residual corrections) stored in SQLite. This ensures zero data loss, guarantees rollback capability, and cuts memory usage.

The pipeline dynamically adjusts its computational needs to match your system specs:

BASELINE

(CPU-safe): Fully operational, offline-ready execution on standard CPU hardware. Bypasses GPU requirements gracefully.GPU_ENHANCED

: Activates local NVIDIA GPU (CUDA 12.1) and WSL2 accelerated audio processing paths for fast, high-volume ingestion.

GoodQ4All compiles the isolated Python environment, the Qdrant database, and perception libraries into a single executable wrapper:

Download and run GoodQ4All_Setup_1.0.0.exe

. - Launch GoodQ4All from the desktop shortcut. - Open the local Retro Memory Explorer dashboard athttp://127.0.0.1:30000/ui/retro_console_v1/

. - Drag-and-drop video/audio files onto the yellow-dotted Upload Pad to begin automatic ingestion.

If you are developing, customizing the pipeline, or running from source:

Developer Source Setup Steps (Advanced)

GoodQ4All ships with two local operator console variants:

Classic Operator Console(served at/ui/operator_console_v1/

): Exposes the current scope strip, flight deck, proof/evidence status, recurrence reports, and video inventories.Retro Memory Explorer (v1.4.7)(served at/ui/retro_console_v1/

): A premium cyber-CRT dashboard featuring a four-panel resizable/collapsible layout with floating restore tabs, an entity co-occurrence graph with dynamic zoom and flight transitions, an Inspector panel containing keyframe image/transcript views, and bidirectional timeline checklists.

Guided demo: docs/guides/DEMO.md
First run: docs/guides/FIRST_RUN.md
Install: docs/guides/install/INSTALL.md
Quickstart: docs/guides/install/QUICKSTART.md
Clean memory start: docs/guides/CLEAN_MEMORY_START.md
Data Privacy: docs/guides/general/PRIVACY.md
Architecture: docs/architecture/SYSTEM_ARCHITECTURE.md
Memory Storage: docs/architecture/MEMORY_STORAGE.md
Hybrid Caching: docs/architecture/TURBOQUANT_HYBRID_CACHING.md
Current Agent State: docs/agent/CURRENT_STATE.md

MIT. See LICENSE.

source & further reading

github.com — original article

Build a full knowledge graph locally with any video

Run your AI side-project on zahid.host