AI Engineer in Vancouver, BC — Production AI, Built in the Open

wpnews.pro

cd /news/artificial-intelligence/ai-engineer-in-vancouver-bc-producti… · home › topics › artificial-intelligence › article

[ARTICLE · art-26674] src=blog.r-lopes.com ↗ pub=2026-06-05T14:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

AI Engineer in Vancouver, BC — Production AI, Built in the Open

Rafael Lopes, a production AI engineer in Vancouver, BC, builds and ships hybrid-RAG pipelines, distributed LLM inference across four compute architectures, and a sovereign research copilot on a self-hosted homelab, documenting everything in the open. His platform serves live traffic from a K3s cluster with no cloud compute, using GitOps and Cloudflare Tunnel for edge security.

read3 min views14 publishedJun 5, 2026

What I Build #

I'm Rafael Lopes — "Rafa" — a production AI engineer based in Vancouver, British Columbia. I don't write about AI from the sidelines; I ship it. The systems below all serve live traffic from a self-hosted cluster in one room:

A hybrid-RAG pipeline over 69,000+ curated technical chunks (BM25 + TF-IDF + weighted RRF + cross-encoder rerank), with an automated quality gate that strips fabricated quotes before anything publishes. Distributed LLM inference across four compute architectures — ARM, AMD ROCm, NVIDIA CUDA, and Apple Silicon — pooling memory over the llama.cpp RPC protocol for models too large for one GPU., a sovereign research copilot for Canadian HPC — every byte of the inference path stays local, with a live ledger proving zero foreign hops per query.exaflop.ca

The Stack #

The whole platform is documented, not described:

How the briefs are made→ the retrieval → synthesis → quality-gate → publish pipeline, with the real numbers.** The infrastructure→ a four-architecture K3s homelab, GitOps via Argo CD, Cloudflare Tunnel + Zero Trust at the edge — no cloud compute. A from-scratch RAG build**→ the actual BM25/TF-IDF/RRF code and measured retrieval quality.

The Daily Brief #

Every weekday I publish a cross-domain engineering brief — AI, web performance, system design, security, and the career arc — synthesized from the corpus, cited to source, and shipped through the same quality gate. The archive is the proof of consistency: nobody fakes a dated, cited, cross-domain brief every working day.

The Infrastructure #

No managed Kubernetes, no hosted CI, no hyperscaler in the data path. A Raspberry Pi runs the K3s control plane; an AMD-ROCm workstation does the GPU heavy lifting; an x86 box self-hosts GitLab and the registry; a Mac M3 Max joins as an RPC peer. Every change goes git → CI → Argo CD → live. The platform that runs this blog is the same one that runs the research copilot.

Available For #

Vancouver-based and remote-friendly. Open to: Consulting on production RAG, LLM inference, and AI platform/SRE work.Speaking on sovereign/local-first AI, web performance for AI consumers, and homelab-scale inference.Collaboration with teams shipping real AI infrastructure who want the receipts, not the hype.

Teaching by doing — production AI, not commentary. The system is the proof.

FAQ #

Who is the AI engineer in Vancouver behind this site? Rafael Lopes ("Rafa") — a production AI engineer based in Vancouver, British Columbia. He builds and ships RAG pipelines, distributed LLM inference, and a sovereign research copilot on a self-hosted homelab, and documents the results in the open.

What does a production AI engineer do? Builds AI systems that serve real traffic — retrieval pipelines, LLM inference, quality gates, and the platform/SRE work to run them — rather than writing about AI from the sidelines. Here, every claim links to a live system or a measured number.

What AI does Rafael Lopes build? Hybrid retrieval (BM25 + TF-IDF + weighted RRF + cross-encoder rerank), distributed LLM inference across four compute architectures over the llama.cpp RPC protocol, and exaflop.ca — a sovereign, local-first research copilot for Canadian HPC.

Where can I read more? The daily cross-domain engineering brief, the how-it-works pipeline, and the infrastructure write-up — all linked below and at blog.r-lopes.com.

source & further reading

blog.r-lopes.com — original article Quick tips for fast iteration in Haskell

~/api · this article 200

$curl api.wpnews.pro/v1/news/ai-engineer-in-vancouver…

Read original on blog.r-lopes.com → blog.r-lopes.com/posts/ai-engineer-vancouver

mentioned entities

Rafael Lopes

exaflop.ca

K3s

Argo CD

Cloudflare Tunnel

llama.cpp

AMD ROCm

NVIDIA CUDA

metadata

slugai-engineer-in-vancouver-bc-production-ai-built-in-the-open

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalblog.r-lopes.com

navigation

← prevXpeng spends $500M/year on AI tr…

next →A Deep Dive into Calibration of …

── more in #artificial-intelligence 4 stories · sorted by recency

dev.to · 29 Jul · #artificial-intelligence

The Personal AI Cloud: Why Everyone Will Have Their Own Private LLM by 2027

startupfortune.com · 29 Jul · #artificial-intelligence

GrubMarket files for IPO at $4.5 billion valuation as AI rewires the food supply chain

thenextweb.com · 29 Jul · #artificial-intelligence

Visa is cutting 2,600 tech jobs to bankroll the tech that threatens it

unite.ai · 29 Jul · #artificial-intelligence

MoonPay’s PayBox Lets AI Agents Spend Without Taking Custody

── more on @rafael lopes 3 stories trending now

wpnews · 16 Jul · #artificial-intelligence

Women entrepreneurs are less likely to leverage AI—but more likely to benefit from it

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

wpnews · 28 Jul · #artificial-intelligence

How Claude Code and VS Code turned Anthropic from a safety lab into a developer phenomenon

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required