I Built a Search Engine That Understands Meaning — in ~150 Lines, Zero API Keys

wpnews.pro

cd /news/natural-language-processing/i-built-a-search-engine-that-underst… · home › topics › natural-language-processing › article

[ARTICLE · art-26538] src=dev.to ↗ pub=2026-06-13T22:36Z topic=natural-language-processing verified=true sentiment=↑ positive

I Built a Search Engine That Understands Meaning — in ~150 Lines, Zero API Keys

A developer built a semantic search engine in about 150 lines of code using the all-MiniLM-L6-v2 embedding model and pgvector on Postgres, requiring no API keys. The system matches queries like "animals that live in the ocean" to articles such as "Blue whale" and "Coral reef" based on meaning rather than keywords.

read3 min views22 publishedJun 13, 2026

Type "animals that live in the ocean" into a normal search box and it hunts

for the words animals, live, ocean. An article titled "Blue whale" that

never uses any of those words? Missed.

Today we fix that. We'll build a search engine that matches on meaning, so

"animals that live in the ocean" surfaces Blue whale and Coral reef —

no shared keywords required.

The whole thing is a few hundred lines, runs on free tooling, and needs no API key. The two ideas you'll walk away understanding are the foundation under every

This is Day 45 of my TechFromZero series — one new technology every day, built

from scratch, every line explained.

An embedding is a list of numbers that captures what a piece of text means.

A good embedding model places texts about similar ideas close together in that

number-space, even when they share no words:

Our model, all-MiniLM-L6-v2

, turns any text into 384 numbers. It runs

locally through Transformers.js —

downloads once (~25 MB), then costs nothing and sends nothing to the cloud.

import { pipeline } from "@xenova/transformers";

const extractor = await pipeline("feature-extraction", "Xenova/all-MiniLM-L6-v2");

// pooling:"mean" -> one vector per sentence; normalize:true -> cosine-ready
const output = await extractor("Blue whale", { pooling: "mean", normalize: true });
const vector = Array.from(output.data); // [0.013, -0.05, ... ] 384 of them

You could keep a separate vector database. But if your data already lives in

Postgres, pgvector adds a real vector

column type and the distance math right inside Postgres. One database, no extra

bill.

CREATE EXTENSION IF NOT EXISTS vector;   -- turn pgvector on

CREATE TABLE articles (
  id        SERIAL PRIMARY KEY,
  title     TEXT,
  summary   TEXT,
  embedding vector(384)                  -- <-- 384 must match the model
);

-- an approximate-nearest-neighbour index so search stays fast at scale
CREATE INDEX ON articles USING hnsw (embedding vector_cosine_ops);

The official pgvector/pgvector:pg16

Docker image has the extension baked in, so

local setup is one line:

docker compose up -d

We need a pile of text. Wikipedia's REST API is public and keyless — its

/page/random/summary

endpoint hands back a clean title + extract. We pull a few

hundred, embed each, and insert the row with its vector:

const vector = await embed(`${a.title}. ${a.summary}`);
await pool.query(
  `INSERT INTO articles (title, url, summary, embedding)
   VALUES ($1, $2, $3, $4::vector)`,
  [a.title, a.url, a.summary, `[${vector.join(",")}]`]
);

(pgvector accepts a vector as the text literal [0.1,0.2,...]

— that's the

$4::vector

cast.)

Here's the payoff. Embed the user's query with the same model, then let

Postgres rank rows by how close their vectors are. The magic operator is <=>

—

cosine distance. Smaller means closer; 1 - distance

gives a tidy 0–1

similarity score.

const queryVec = `[${(await embed(userQuery)).join(",")}]`;

const { rows } = await pool.query(
  `SELECT title, url, summary,
          1 - (embedding <=> $1::vector) AS similarity
   FROM articles
   ORDER BY embedding <=> $1::vector      -- nearest neighbours first
   LIMIT 5`,
  [queryVec]
);

That's it. No keyword index, no synonyms list, no stemming rules. The model

already learned that whales live in oceans.

Searching "famous battles in history" in my 300-article corpus returns

Napoleonic engagements and ancient sieges — articles that never contain the word

"famous". Searching "how the brain works" surfaces neuroscience pages that say

"neuron" and "cortex", not "brain works".

animals that live in the ocean
  92.1%  Blue whale
  88.4%  Coral reef
  85.0%  Sea otter

This tiny project is the core of every "chat with your docs" / "AI that knows

your data" feature. Retrieval-Augmented Generation (RAG) is literally:

Get embeddings + vector search, and RAG stops being mysterious.

git clone https://github.com/dev48v/pgvector-from-zero.git
cd pgvector-from-zero
npm install
cp .env.example .env
docker compose up -d
npm run seed
npm run dev      # http://localhost:3000

Every file has STEP headers and WHY comments, and the commits are ordered one

concept at a time — clone it and read them top to bottom.

Repo: https://github.com/dev48v/pgvector-from-zero

This was Day 45 of TechFromZero. A new technology every day, built from scratch.

Follow along — tomorrow's pick lands next.

source & further reading

dev.to — original article OpenWorker: Andrew Ng's Local-First AI Coworker, Explained for Developers J-space in practice: using Anthropic's Jacobian lens to decide what an LLM can forget Top AI Papers on Hugging Face - 2026-07-29

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-built-a-search-engine-…

Read original on dev.to → dev.to/dev48v/i-built-a-search-engine-that-under…

mentioned entities

Transformers.js

pgvector

Postgres

Wikipedia

all-MiniLM-L6-v2

Xenova

metadata

slugi-built-a-search-engine-that-understands-meaning-in-150-lines-zero-api-keys

topic#natural-language-processing

secondary2 topics

sentimentpositive

canonicaldev.to

navigation

← prevDesigning Software for Software …

next →From a Simple Web App to a Produ…

── more in #natural-language-processing 4 stories · sorted by recency

kdnuggets.com · 29 Jul · #natural-language-processing

5 Must-Read Resources for Mastering Small Language Models

dev.to · 29 Jul · #natural-language-processing

Scanning agent transcripts for secrets, without sending them anywhere

dev.to · 29 Jul · #natural-language-processing

Why We Use Arithmetic, Not AI, to Check Our AI

discuss.huggingface.co · 29 Jul · #natural-language-processing

Smart matching system using AI

── more on @transformers.js 3 stories trending now

wpnews · 16 Jul · #artificial-intelligence

Women entrepreneurs are less likely to leverage AI—but more likely to benefit from it

wpnews · 28 Jul · #artificial-intelligence

How Claude Code and VS Code turned Anthropic from a safety lab into a developer phenomenon

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required