54/60 Days System Design Questions

wpnews.pro

cd /news/machine-learning/54-60-days-system-design-questions · home › topics › machine-learning › article

[ARTICLE · art-43707] src=dev.to ↗ pub=2026-06-29T16:20Z topic=machine-learning verified=true sentiment=· neutral

54/60 Days System Design Questions

A developer describes a common problem with RAG pipelines: data drift causes stale search results even when no code changes are made. They present four solutions for keeping a 400GB FAISS index fresh: scheduled full rebuild, incremental upserts, embedding version registry, and approximate staleness detection. The post asks readers to choose the best approach given a 50M-chunk corpus and model updates every six weeks.

read1 min views1 publishedJun 29, 2026

You built a RAG pipeline. Works great in dev.

6 months later, your users complain: "The search results are garbage."

You haven't changed a line of code.

Here's what happened:

Your product evolved. New features, new docs, new support tickets. The data drifted — but your embedding index didn't.

Now you're serving a 400GB FAISS index that was last rebuilt in January. Your chunks are stale. Your nearest-neighbor results point to deprecated docs. Your LLM is confidently hallucinating from outdated context.

You need to fix this. 4 engineers each propose a solution:

A) Scheduled full rebuild

Every Sunday, re-embed the entire corpus from scratch. Replace the index atomically. Slow (4h+ at scale), expensive, but always fresh.

B) Incremental upserts + soft delete

On every document change, re-embed only the affected chunks. Mark deleted chunks as tombstoned. Keep a version field on each vector. Index size grows over time; compact quarterly.

C) Embedding version registry + hot swap

Track which embedding model version produced each vector. When the model drifts (fine-tuned or upgraded), invalidate the mismatched vectors and rebuild only those. Two indexes run in parallel during migration. Route traffic by model version.

D) Approximate staleness detection

Run a nightly job that samples 1% of your corpus, re-embeds it, and measures cosine distance against the stored vector. If drift exceeds a threshold, trigger a full rebuild. Otherwise, skip it. Cheap monitoring, reactive rebuilds.

Real constraint: your corpus is 50M chunks. Full rebuild = 4 hours + ~$800 in embedding API cost. You deploy model updates every 6 weeks.

Pick one — A, B, C, or D — and tell me why. Full breakdown in the comments.

source & further reading

dev.to — original article Prompt Engineering That Actually Ships: A Practical Guide Vidilearn: AI Knowledge Ingestion & Retrieval Gateway for LLMs, Agents, and MCP Servers Building an Agentic SOC Analyst with Microsoft Sentinel, Azure Log Analytics, and Gemini

~/api · this article 200

$curl api.wpnews.pro/v1/news/54-60-days-system-design…

Read original on dev.to → dev.to/thejoud1997/5460-days-system-design-quest…

mentioned entities

FAISS

LLM

metadata

slug54-60-days-system-design-questions

topic#machine-learning

secondary3 topics

sentimentneutral

canonicaldev.to

navigation

← prevThe API Stack Every Modern Devel…

next →Google Meet’s Gemini-powered ‘Ta…

── more in #machine-learning 4 stories · sorted by recency

dev.to · 29 Jun · #machine-learning

Vidilearn: AI Knowledge Ingestion & Retrieval Gateway for LLMs, Agents, and MCP Servers

dev.to · 29 Jun · #machine-learning

MiniMax: What It Actually Means to Run on This Model

devclubhouse.com · 29 Jun · #machine-learning

How a Database Schema Error Triggered an Expensive AI Retry Storm

github.com · 29 Jun · #machine-learning

Show HN: KaliCart – an agent-readable catalog for WooCommerce (no API key)

── more on @faiss 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required