{"slug": "how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out", "title": "How We Generate 300+ AI Business Ideas a Month With GPT-5 (and Filter the Junk Out)", "summary": "A developer built a pipeline that generates over 300 AI business ideas per month for the AI Student Factory platform, using GPT-5 to populate an idea library. The system employs a validation gate to filter out low-quality suggestions, ensuring every stored idea represents a viable six-month project for an aspiring founder. The pipeline stores ideas in a single Postgres table with provenance tracking, and the developer cut trend-scraping inputs and model ensembling after they degraded output quality.", "body_md": "Six months ago I shipped an AI quiz that matches aspiring founders to a business idea they can actually build. The matcher only works if the underlying idea library is large, fresh, and not full of slop. So I had to build the pipeline that fills it.\n\nThis post walks through the real architecture: prompt design, the validation gate, the day it silently produced zero ideas for 48 hours, and what we'd cut if we started over. If you're building anything that uses LLMs to generate structured content at scale, you'll probably hit the same walls.\n\nThe product this powers is [AI Student Factory](https://aistudentfactory.com) — but the pipeline is generic.\n\nThe matcher quiz returns a single idea per user. If that idea is bad, the entire product is bad. So the bar wasn't \"generate lots of ideas\" — it was **\"every idea in the library has to be someone's reasonable next 6 months.\"**\n\nThat meant each row needed:\n\nLLMs are bad at all of these by default. They will gleefully invent a keyword volume of \"8,400 searches/month\" for a phrase no human has ever Googled.\n\nThat's it. Everything else gets accepted and surfaced; the matcher quiz handles ranking at retrieval time, not at insertion time. Loose at ingest, strict at query — same pattern as a search engine.\n\nA single `ideas`\n\ntable in Postgres, with:\n\n`jsonb`\n\ncolumn for the build steps and tags`published`\n\nboolean for soft-gating`summary`\n\ncolumn with a length check enforced at the DB level (not in app code — DB is the last honest layer)Don't put the build plan in 6 separate columns. You will regret it the moment you want to support a 4-step idea.\n\n**Cut: the \"trend signal\" inputs.** I scraped Reddit/HN/PH for trending topics and fed them in as inspiration. The output got worse, not better — the model latched onto whatever was in the seed and ignored its training. Now I just give it a niche category and let it cook.\n\n**Cut: model ensembling.** I tried running each idea through two models and merging. It was 2× the cost for a noise-level quality improvement. Pick one good model and trust it.\n\n**Keep: per-row provenance.** Every idea stores the prompt version, model, and timestamp that generated it. When quality drifts, I can diff prompt versions against acceptance rate. This caught a regression three weeks ago that would otherwise have been invisible.\n\nMost of what made this work was operational, not clever:\n\nThe full product — the quiz that uses this library to match people to ideas — is at [aistudentfactory.com](https://aistudentfactory.com). Happy to dig into any specific piece in the comments.\n\n*If you're building something similar and want to compare notes, my email is in the footer of the site. I read everything.*", "url": "https://wpnews.pro/news/how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out", "canonical_source": "https://dev.to/erfan_khoshnazar_5d51a5f7/how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out-4iml", "published_at": "2026-05-30 19:44:42+00:00", "updated_at": "2026-05-30 20:11:52.053477+00:00", "lang": "en", "topics": ["ai-startups", "ai-products", "ai-tools", "large-language-models", "generative-ai"], "entities": ["GPT-5", "AI Student Factory"], "alternates": {"html": "https://wpnews.pro/news/how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out", "markdown": "https://wpnews.pro/news/how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out.md", "text": "https://wpnews.pro/news/how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out.txt", "jsonld": "https://wpnews.pro/news/how-we-generate-300-ai-business-ideas-a-month-with-gpt-5-and-filter-the-junk-out.jsonld"}}