Building Minyut: An Embeddable RAG Chatbot in One Script Tag

wpnews.pro

cd /news/large-language-models/building-minyut-an-embeddable-rag-ch… · home › topics › large-language-models › article

[ARTICLE · art-32151] src=dev.to ↗ pub=2026-06-18T05:50Z topic=large-language-models verified=true sentiment=↑ positive

Building Minyut: An Embeddable RAG Chatbot in One Script Tag

Minyut is an embeddable RAG chatbot that can be integrated via a single script tag, built to prevent AI hallucinations by constraining answers to uploaded content. The chatbot processes queries for websites built on Webflow, WordPress, Shopify, React, and plain HTML, using Supabase for storage and HuggingFace for embeddings. Minyut employs Shadow DOM for cross-site compatibility and offers fixed monthly pricing to avoid billing uncertainty.

read4 min views27 publishedJun 18, 2026

A client needed their customers to be able to query a 40-page policy document without reading through it.

We built the first version of what became Minyut in a weekend.

It used a basic embedding approach, answered from an OpenAI endpoint, and—in testing—confidently responded to questions that had no answer in the document at all.

It made things up.

Fluently.

Completely wrong.

That was the founding problem.

Every document chatbot we tested made things up. Not because the models were bad, but because they weren't constrained to answer only from the documents.

Minyut is built around a single architectural decision:

Every answer must come from uploaded content—or the chatbot says, "I don't know."

Everything else follows from that constraint.

Today, Minyut processes queries for chatbots embedded on Webflow sites, WordPress installs, Shopify stores, React apps, and plain HTML pages.

Documents are stored in Supabase's Mumbai region, and the widget can be embedded with a single script tag in under ten minutes.

Here's how it's built.

Standard AI chatbots answer from their training data.

For general knowledge, that's exactly what you want.

For support chatbots, legal documents, policy manuals, product specifications, and consultancy websites, accuracy becomes a liability issue rather than a convenience feature.

A chatbot that invents policy details isn't a support tool.

It's a liability.

The solution is Retrieval-Augmented Generation (RAG).

At query time:

If the answer isn't present in the uploaded documents, the chatbot says so.

The language model can only answer as well as the passages you retrieve.

Good retrieval is most of the battle.

Documents arrive as:

File limits:

After extraction, documents are chunked.

Each sentence became its own chunk.

Retrieval was precise but context disappeared.

Example:

Question: What is the refund window?

Retrieved:

Refunds are processed in 7 days.

Technically correct.

Practically useless.

Context improved.

Retrieval consistency did not.

Short and long paragraphs behaved very differently during similarity search.

Current strategy:

The overlap ensures sentences crossing chunk boundaries remain complete in at least one retrieved section.

For Minyut's document types, answer quality improved significantly.

Each chunk is embedded using:

sentence-transformers/all-MiniLM-L6-v2

via the HuggingFace Inference API.

The model generates:

Vectors are stored in:

inside Supabase.

The hard problem wasn't a script.

It was ensuring the widget worked everywhere.

Different host websites bring:

Our first approach used scoped CSS.

It failed repeatedly.

Examples:

The solution was Shadow DOM.

The widget creates a completely isolated DOM tree.

Host styles cannot leak in.

Widget styles cannot leak out.

const host = document.createElement('div');
document.body.appendChild(host);

const shadow = host.attachShadow({
  mode: 'open'
});

Everything lives inside the shadow root.

Style conflicts effectively disappear.

The widget is delivered as a single async script:

<script async src="https://minyut.com/widget.js"></script>

Advanced users can control behavior through:

window.__minyut__

including:

Minyut's stack is intentionally simple.

Handles:

Hosts:

Handles subscription billing.

Provides embedding generation.

Handles language model inference.

Documents are stored in:

Bring Your Own Key support allows users to connect:

Keys are encrypted using AES-256.

We never need to operate GPU infrastructure ourselves.

At Minyut's current scale, that's exactly the tradeoff we want.

The most common complaint about chatbot SaaS products isn't quality.

It's billing uncertainty.

Usage-based pricing creates anxiety.

A traffic spike should not become a surprise invoice.

Minyut uses fixed monthly plans.

Users receive notifications at:

Nothing fails silently.

Unlimited queries through the user's own OpenAI or Groq key.

We handle:

Users pay model providers directly.

We spent weeks comparing language models.

The bigger factor was chunk size and overlap.

Fix retrieval before optimizing inference.

Scoped CSS eventually breaks.

Shadow DOM doesn't.

Once we switched, CSS-related issues effectively disappeared.

The users who want BYOK are often the most engaged.

They build real systems.

Supporting them from the start avoids painful architectural changes later.

Minyut started as an attempt to solve a simple problem:

How do you build a chatbot that answers only from documents and refuses to invent information?

The answer ended up being a combination of:

The result is a chatbot that can be embedded on almost any website using a single script tag and answer only from uploaded content.

That's exactly what we set out to build.

No.

Minyut is designed specifically for document-grounded responses.

If the information isn't present in uploaded content, the chatbot says it doesn't know.

The widget has been tested on:

Because it uses Shadow DOM isolation, it works reliably across virtually any platform that permits custom JavaScript.

source & further reading

dev.to — original article Scaling AI Beyond the Monolith: Multi-Agent Coordination via Federated MCP Servers Databricks Lakebase: Give Your Agent a Branch, Not Your Production Database Dollars and rupees without Stripe: what building Skill Exchange's checkout taught me (PayPal + UPI)

~/api · this article 200

$curl api.wpnews.pro/v1/news/building-minyut-an-embed…

Read original on dev.to → dev.to/flux8labs/building-minyut-an-embeddable-r…

mentioned entities

Minyut

OpenAI

Supabase

HuggingFace

Webflow

WordPress

Shopify

Groq

metadata

slugbuilding-minyut-an-embeddable-rag-chatbot-in-one-script-tag

topic#large-language-models

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevVulnerabilities and exploits: wh…

next →OpenAI's Sam Altman postpones vi…

── more in #large-language-models 4 stories · sorted by recency

authoryze.ai · 2 Aug · #large-language-models

Show HN: Authoryze- payment controls for AI agents

dev.to · 2 Aug · #large-language-models

Building an LLM API Gateway in Node.js

ycrootaccess.com · 2 Aug · #large-language-models

Boris Cherny on Trying to Get Claude Code to Rewrite the Claude App

dev.to · 2 Aug · #large-language-models

Scaling AI Beyond the Monolith: Multi-Agent Coordination via Federated MCP Servers

── more on @minyut 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 2 Aug · #artificial-intelligence

I Ran 8 AI APIs Through the Same 50 Prompts — Here's the Real Cost Breakdown

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required