I built a local-first movie recommender with Corrective-RAG (cited explanations, hybrid retrieval, runs entirely on Ollama)

wpnews.pro

cd /news/large-language-models/i-built-a-local-first-movie-recommen… · home › topics › large-language-models › article

[ARTICLE · art-13926] src=dev.to ↗ pub=2026-05-25T22:50Z topic=large-language-models verified=true sentiment=↑ positive

I built a local-first movie recommender with Corrective-RAG (cited explanations, hybrid retrieval, runs entirely on Ollama)

A developer built a local-first movie recommendation system using a Corrective-RAG pipeline that runs entirely on Ollama. The system employs query expansion at ingestion time rather than query time, generating 3-5 pseudo-queries per movie to improve scalability. On an M3 Mac with 36GB RAM, the system achieves approximately 90-second query latency with llama3, dropping to 15-20 seconds with llama3.2:1b.

read1 min views12 publishedMay 25, 2026

Hey — sharing a project I've been building for the last

few months. It's a movie recommendation system that runs entirely on

your laptop using Ollama, with a Corrective-RAG pipeline.

Why I built it: existing streaming platforms only know what you

watched on them. Netflix can't see my Prime history, none of them know

about cinema watches. Wanted one system that learns from all of it.

Stack:

The interesting design choice was query expansion at INGEST time instead

of query time. The enrichment LLM generates 3-5 pseudo-queries per movie

and embeds them alongside the plot. Catalogues are bounded; user queries

aren't, so paying the LLM cost once per movie scales better than once

per query.

Latency on M3 / 36GB / Ollama llama3: ~90s/query (filter_extract +

explain dominate). llama3.2:1b drops to ~15-20s. Hosted models ~5-10s.

Code + setup: github.com/meetgrewal7793-creator/personal-movie-recommender

The 7-stage architecture diagram is in the README. Feedback welcome —

especially on the grader prompt calibration, which I had to relax for

local-LLM defaults because llama3 graders over-flag results as weak.

source & further reading

dev.to — original article Best AI Agent Governance Tools in 2026: A Layer-by-Layer Guide What I learned building an AI video background changer AI Agent Runtime Policy: Stop Dangerous Tool Calls Before They Execute

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-built-a-local-first-mo…

Read original on dev.to → dev.to/a_aesthetic_dbd654c063b47/i-built-a-local…

mentioned entities

Ollama

Netflix

Prime

llama3

llama3.2

meetgrewal7793-creator

metadata

slugi-built-a-local-first-movie-recommender-with-corrective-rag-cited-explanations

topic#large-language-models

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevIs the Palantir Valuation Debate…

next →Mec builds first High-NA EUV-fab…

── more in #large-language-models 4 stories · sorted by recency

byteiota.com · 10 Jul · #large-language-models

Ollama Raises $65M Series B: What Changes for Developers

tensorsharp.ai · 10 Jul · #large-language-models

Show HN: TensorSharp: Open-Source Local LLM Inference Engine

github.com · 1 Jul · #large-language-models

Ragit – chat with any folder of documents using a local LLM

dev.to · 27 Jun · #large-language-models

Building a Local-First Voice Copilot for the Shell with HoldSpeak and Ollama

── more on @ollama 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required