How many times have you wanted to search your private PDFs, notes, or code files using AI, but hesitated?
We all want the power of AI search. But up sensitive documents to external servers is a big privacy risk.
What if you could build a complete search engine that runs 100% inside your browser? No servers, no APIs, and no cost.
At Utilora, we built exactly this. We call it Personal RAG. Here is how we made it work, and how you can do it too.
Retrieval-Augmented Generation (RAG) usually requires a backend database, python servers, and API keys. To make it run entirely on the client side, we
combined three modern browser technologies:
Here is the exact flow of how a document is processed:
[ Your File ] β [ Client Parser ] β [ Chunking ] β [ Local ML Embedding ] β [ OPFS Storage ]
You cannot store millions of text numbers in normal browser storage like LocalStorage. It is too slow and has a 5MB limit.
Instead, we use the Origin Private File System (OPFS). It gives web
apps a private, highly optimized filesystem. Here is a simple look at how we write vector indexes to OPFS:
// Access the private root directory
const root = await navigator.storage.getDirectory();
// Create or access our index file
const fileHandle = await root.getFileHandle("vector-index.db", { create: true });
// Create a high-speed write stream
const accessHandle = await fileHandle.createWritable();
await accessHandle.write(new TextEncoder().encode(JSON.stringify(myVectorData)));
await accessHandle.close();
We use Comlink by Google to easily communicate with a background Web Worker:
// In your main component
import * as Comlink from "comlink";
const worker = new Worker(
new URL("./rag-indexer.worker.ts", import.meta.url),
{ type: "module" }
);
const localIndexer = Comlink.wrap(worker);
// Run indexer in the background
await localIndexer.processAndEmbedFile(myUploadedFile);
Building with zero backend constraints completely changes how you think about software:
β’ True Privacy: Privacy is not a text policy on a page. It is hardcoded into the architecture. Since there is no backend, we cannot see your files even if
we wanted to.
β’ Completely Free: You do not pay for API keys, vector databases, or server hosting. The user's computer does all the work.
β’ Instant Offline Access: Once the page loads, you can turn off your internet and it still works.
If you want to see this in action, come check it out on Utilora https://utilora.com (our free, open collection of local web utilities).
Drag in a PDF, let it index, and ask questions. Your data never leaves your screen.
Have you built anything using local browser models? Let's chat in the comments below!