# Making of Aantraa

> Source: <https://dev.to/shreyvijayvargiya/making-of-aantraa-3475>
> Published: 2026-06-26 15:20:30+00:00

**aantraa.site** — AI audio & video translation, caption generator, and viral shorts cutter.

I run a small YouTube channel. I'm not a full-time content creator, but YouTube is a solid platform to gain traffic for your online work, business, project, or idea.

**Aantraa** is what I built in a week. The main concept is simple:

At that time, only three features were needed, so website development wasn't the heavy lift. The real work was building APIs, backend infrastructure to integrate AI into video, and dealing with heavy storage.

Aantraa is heavily dependent on AI APIs — we need reliable infrastructure for LLM providers.

OpenRouter, Portkey, Vercel AI SDK labs, and individual APIs for Anthropic, Deepseek, and OpenAI are solid options.

I prefer **OpenRouter** for Aantraa for one reason: **multiple model support** — it's easy to pick the cheapest capable model for each job. Easy to integrate, strong community support, free model access, and more.

AI LLM APIs are needed at almost every stage in the backend:

Each layer needs heavy AI context and prompt engineering. **Loop engineering** is the trend here — and it's required for aantraa.

For example, **video translation** works in multiple connected steps:

Each step depends on the previous one, which makes production debugging hard when something breaks.

**Solution:** Track each process — usage tokens, estimated time, errors, and response metadata.

The same pattern applies to audio translation, viral clip cutter, and caption generator.

Local API development is manageable until you ship to production.

| Layer | Choice |
|---|---|
| Framework | Hono.js |
| Backend hosting | Vercel Edge or Fly.io |
| DevOps | Docker, simple Git CI/CD |
| Database | Firebase / Supabase |
| Storage | UploadThing |

I found **UploadThing** as a practical alternative to AWS S3 and Firebase/Supabase storage for file uploads. It provides client and server SDKs to upload files quickly (5 MB per chunk on the free plan).

We need storage heavily because every AI layer doesn't keep its own memory — every generated audio/video file must land in storage.

**FFmpeg** is essential for video and audio work, but it has limitations on serverless functions and Vercel Edge. That pushed us toward **Fly.io**, Railway, or Render for heavier media workloads.

Aantraa supports **90+ languages** for video and audio translation.

AI translates scripts, text, on-screen text, and video context well — it needs to understand the video through the script and each frame. **FFmpeg** helps with that pipeline.

The flow:

Each step needs debugging, prompt engineering, and FFmpeg integration.

Finally, the output uploads to storage and returns a URL to the client for download and playback.

Aantraa isn't only video translation. As a creator, I also wanted to turn blog posts or text into audio — podcast-style listening.

The **audio translation** tool covers:

That makes aantraa a supporting platform: one recording → 90+ language MP3s, ready for one-click sharing.

The viral shorts feature converts long-form YouTube video into short clips you can upload directly to your channel.

It sounds simple, but production needs FFmpeg tuning, AI context limits, and file-size guardrails — videos over ~10 MB cost more time and money to process.

See **10+ translated videos** in Spanish, Hindi, Bengali, Gujarati, Marathi, Tamil, French, English, Japanese, Chinese, and more on our [examples page](https://aantraa.site/examples).

90+ languages make aantraa a universal, global platform.

The **first version is live**. We also offer **APIs for business agencies and startup teams** — reach out via [contact](https://aantraa.site/contact) if you're interested.

**Links**

Try the product and share your feedback — early signups get **1 free minute** of AI translation credit.

Cheers,

**Shrey**

Aantraa · [aantraa.site](https://aantraa.site)
