# Building an AI task generator for Vikunja that doesn't trust itself

> Source: <https://dev.to/yooshyasha/building-an-ai-task-generator-for-vikunja-that-doesnt-trust-itself-ie7>
> Published: 2026-05-31 10:44:15+00:00

I self-host [Vikunja](https://vikunja.io/) to organize my projects, and I kept hitting the same chore: turning a vague idea ("set up CI for a Kotlin service") into a clean list of individual tasks, by hand, every time.

LLMs are obviously good at this kind of decomposition. But I didn't want an AI that silently writes a pile of tasks into my tracker and hopes for the best. I wanted it to *propose*, and then get out of my way so I can approve.

So I built **Trof** — an AI task generator for Vikunja with a human-in-the-loop step baked in. It's open source and self-hosted. This post is about how it works and a few decisions I made along the way.

Why "Trof"?It's named after the Trophy active protection system on the Merkava tank, which intercepts incoming threats before they reach the hull. Trof does the same for your task board: nothing lands in Vikunja until you've reviewed it.

It also works for editing existing projects, not just creating new ones — you describe the change, and it proposes a diff of tasks to add, edit, or remove.

I split it into two services instead of one:

The agent workflow runs on [Koog](https://github.com/JetBrains/koog), JetBrains' agent framework for Kotlin. The frontend is React + Vite. Everything ships as Docker Compose behind nginx with TLS.

It's provider-agnostic — Anthropic, OpenAI, Google, DeepSeek, or fully local via Ollama, in which case nothing leaves your

machine. Your Vikunja token and API keys live in

The interesting part wasn't "call an LLM and pars-before-apply** boundary.

It would have been easier to let the agent call t's basically what an MCP server does). But thenthe AI mutates your real data on every run, and you're left cleaning up after a confident-but-wrong decomposition.

Instead, the AI never touches Vikunja. It produces a *proposal*; the gateway holds it; you edit it; and only an explicit confirm

turns it into real API calls. The AI proposes, yo manages my actual projects, that tradeoff wasworth the extra plumbing.

This one I'm genuinely unsure about. When the description is vague, the agent pauses and asks a follow-up question instead of

guessing. It produces noticeably better task breaound-trip and some friction.

If you've built conversational/agentic tools: do t, or generating a best-effort result and lettingthe user correct it? I keep going back and forth.

It's early and rough in places, but it works end to end. Repo with setup instructions:

👉 [https://github.com/Yooshyasha/Trof](https://github.com/Yooshyasha/Trof)

Feedback welcome — especially on the review flow and whether the clarifying-questions step is worth the friction.
