{"slug": "jenkins-continues-development-of-ai-chatbot-for-resources", "title": "Jenkins Continues Development of AI Chatbot for Resources", "summary": "Mallikarjun G D and Daniele Caldarigi published Jenkins blog posts on May 26, 2026, detailing two GSoC 2026 projects extending the Jenkins ecosystem with AI chatbot plugins. G D's plugin adds an LLM-as-a-Judge evaluation pipeline using DeepEval metrics, a GraphRAG layer with NetworkX for plugin-dependency queries, and a Build Failure Diagnosis Agent that sanitizes logs with Presidio, while Caldarigi's plugin implements a React+Vite sidebar, FastAPI backend with LangGraph, ChromaDB vector store, and support for local Ollama or external API LLMs. These community-driven projects demonstrate practical integration of RAG, evaluation pipelines, and on-prem LLM options within a mature CI/CD tool, addressing enterprise needs for privacy, latency, and reproducibility.", "body_md": "# Jenkins Continues Development of AI Chatbot for Resources\n\nMallikarjun G D's Jenkins blog post (May 26, 2026) reports a GSoC 2026 continuation of an AI chatbot plugin embedded in the Jenkins UI, extending the project with three core features: an **LLM-as-a-Judge** evaluation pipeline using a curated golden dataset and DeepEval metrics, a **GraphRAG** layer implemented with NetworkX for plugin-dependency queries, and a Build Failure Diagnosis Agent that strips PII with Presidio before passing sanitized logs to the LLM. Daniele Caldarigi's Jenkins blog post (May 26, 2026) describes a complementary GSoC plugin focused on guiding user workflow, with a React+Vite sidebar, a Jenkins Controller, a FastAPI backend using LangGraph, ChromaDB for vectors, and a choice of a local LLM via Ollama or an external API. Industry context: these posts show community-driven experimentation with RAG, evaluation pipelines, and on-prem/local LLM options within a mature CI/CD tool.\n\n### What happened\n\nMallikarjun G D's Jenkins blog post (May 26, 2026) documents a GSoC 2026 continuation of an AI chatbot plugin embedded in the **Jenkins** UI, with three stated feature areas: an **LLM-as-a-Judge** evaluation pipeline using a curated golden dataset and **DeepEval** metrics, a **GraphRAG** layer built with **NetworkX** to traverse plugin dependency relationships, and a Build Failure Diagnosis Agent that sanitizes logs with Presidio before sending context to an LLM. Daniele Caldarigi's Jenkins blog post (May 26, 2026) outlines a related GSoC plugin to guide user workflows, describing a frontend implemented with **React+Vite**, a Jenkins Controller, a **FastAPI** backend, LangGraph for agent reasoning, ChromaDB as the vector store, and a configurable LLM hosted locally with Ollama or via an external API.\n\n### Technical details\n\nEditorial analysis - technical context: The combination of a judge-style evaluation pipeline, explicit GraphRAG for dependency-aware retrieval, and a log-diagnosis agent reflects three complementary technical risks and benefits practitioners track when embedding LLMs into developer tooling. Using an evaluation model and **DeepEval** metrics helps create repeatable benchmarks for retrieval and answer quality, which is important for avoiding regressions as embeddings, prompt templates, and retrieval strategies change. Graph traversal with **NetworkX** is a practical approach for dependency queries, but it raises operational questions around graph size, update cadence, and real-time traversal cost. Integrating Presidio for PII stripping demonstrates an attention to data hygiene; practitioners will want to validate redaction effectiveness across varied build logs and formats.\n\n### Context and significance\n\nIndustry context: Community-driven projects in major engineering tools increasingly combine RAG, local LLM hosting, and evaluation pipelines to balance privacy, latency, and cost. The modular architecture described in Daniele's post - separating frontend, a controller for auth, and a FastAPI backend - mirrors common patterns that let operators choose where to host ChromaDB and their LLM. For open-source CI/CD ecosystems, these choices matter because they affect deployability in air-gapped or enterprise environments and influence maintenance burden for plugin authors.\n\n### What to watch\n\n- •Evaluation: which judge model and\n**DeepEval** metrics the contributors settle on and whether runs are reproducible across hardware. - •GraphRAG scale: how the NetworkX graph is populated and updated as plugin metadata evolves.\n- •Data governance: effectiveness of Presidio redaction and policies for indexing external forums (Discourse, Reddit).\n- •LLM hosting trade-offs: adoption of local Ollama-hosted models versus third-party APIs and the operational implications for latency and cost.\n\n## Scoring Rationale\n\nThis is a notable open-source engineering effort showing practical integration patterns (GraphRAG, evaluation pipelines, PII stripping) relevant to practitioners embedding LLMs in developer tools, but it is not a frontier model or industry-shaking release.\n\nPractice with real FinTech & Trading data\n\n90 SQL & Python problems · 15 industry datasets\n\n[Active Verified Users by Income TierEasy](/problems/sql/active-verified-users-by-income)\n\n[Technology Stocks with High BetaMedium](/problems/sql/technology-stocks-with-high-beta)\n\n[Portfolio Performance ScorecardHard](/problems/sql/portfolio-performance-scorecard)\n\n250 free problems · No credit card\n\n[See all FinTech & Trading problems](/problems/datasets/fintech)", "url": "https://wpnews.pro/news/jenkins-continues-development-of-ai-chatbot-for-resources", "canonical_source": "https://letsdatascience.com/news/jenkins-continues-development-of-ai-chatbot-for-resources-2674fe2b", "published_at": "2026-05-26 21:49:48.868783+00:00", "updated_at": "2026-05-26 21:49:52.024267+00:00", "lang": "en", "topics": ["artificial-intelligence", "large-language-models", "ai-tools", "ai-agents", "mlops"], "entities": ["Jenkins", "Mallikarjun G D", "Daniele Caldarigi", "DeepEval", "NetworkX", "Presidio", "LangGraph", "ChromaDB"], "alternates": {"html": "https://wpnews.pro/news/jenkins-continues-development-of-ai-chatbot-for-resources", "markdown": "https://wpnews.pro/news/jenkins-continues-development-of-ai-chatbot-for-resources.md", "text": "https://wpnews.pro/news/jenkins-continues-development-of-ai-chatbot-for-resources.txt", "jsonld": "https://wpnews.pro/news/jenkins-continues-development-of-ai-chatbot-for-resources.jsonld"}}