[MISSION] RAG system for an endangered spoken language — 10 weeks — full IP transfer

wpnews.pro

cd /news/natural-language-processing/mission-rag-system-for-an-endangered… · home › topics › natural-language-processing › article

[ARTICLE · art-41033] src=discuss.huggingface.co ↗ pub=2026-06-26T16:40Z topic=natural-language-processing verified=true sentiment=· neutral

[MISSION] RAG system for an endangered spoken language — 10 weeks — full IP transfer

A client is seeking an experienced NLP/LLM engineer to build the first RAG-based localization engine for a low-resource South American language within 10 weeks, with full IP transfer and a budget of €5,000–€10,000.

read1 min views1 publishedJun 26, 2026

Hi, I’m looking for an experienced NLP/LLM engineer for a serious, well-scoped project: building the first RAG-based localization engine for a low-resource language spoken in South America.

The project is built on a proprietary corpus (pedagogical content, dictionary, proverbs, orthographic rules) developed over 4 years. Architecture, endpoints, quality criteria and deliverables are fully specified in a detailed technical brief (available under NDA).

**Technical scope (MVP — 10 weeks)**

- RAG pipeline on low-resource language corpus (LangChain or LlamaIndex)
- Multilingual embedding model (multilingual-e5 or equivalent)

Vector DB: Pinecone, Weaviate or Supabase pgvector — latency < 500ms
Modular prompt layer — 6 use-case templates (translate, educate, dub, subtitle, localize, campaign)
Multi-tenant B2B SaaS infrastructure — strict data isolation, JWT auth, configurable quotas
REST API + Swagger documentation
Admin interface for glossary management (no-dev updates)
Offline SQLite bundle for React Native mobile app

Profile required

Proven RAG experience on low-resource or multilingual corpora
Multi-tenant SaaS architecture references
Modular prompt engineering in production (Anthropic or OpenAI API)
Full IP transfer to client — non-negotiable
NDA before corpus access
Budget: 5,000–10,000€ depending on experience

- Start: within 2–3 weeks

If this matches your profile, please reply with concrete references on RAG low-resource languages, multi-tenant SaaS architecture, and modular prompt engineering in production

source & further reading

discuss.huggingface.co — original article Rakarrack-0.6.1 port making progress! ( AI assisted ) Cloud Storage Poll Welcome to Haiku basic(Haiku Docs, Haiku slide and Haiku sheets)

~/api · this article 200

$curl api.wpnews.pro/v1/news/mission-rag-system-for-a…

Read original on discuss.huggingface.co → discuss.huggingface.co/t/mission-rag-system-for-…

mentioned entities

LangChain

LlamaIndex

Pinecone

Weaviate

Supabase

Anthropic

OpenAI

React Native

metadata

slugmission-rag-system-for-an-endangered-spoken-language-10-weeks-full-ip-transfer

topic#natural-language-processing

secondary4 topics

sentimentneutral

canonicaldiscuss.huggingface.co

navigation

← prevThe Salary That Disappeared: 6 R…

── more in #natural-language-processing 4 stories · sorted by recency

dev.to · 25 Jun · #natural-language-processing

Building a RAG-Based PDF Question Answering System: Engineering Decisions, Failures, and Lessons

github.com · 26 Jun · #natural-language-processing

LlamaIndex integration for SynapCores (RAG, GraphRAG, and hybrid retrieval)

dev.to · 25 Jun · #natural-language-processing

Building a Production RAG Pipeline with LlamaIndex and Pinecone

pub.towardsai.net · 25 Jun · #natural-language-processing

The 5 RAG Architectures and Exactly When to Use Each One in Production

── more on @langchain 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required