Agentic AI for the EU AI Act 🇪🇺
A developer built an open-source AI assistant to help users navigate the EU AI Act, which becomes enforceable on August 2, 2026. The system uses a two-agent CrewAI architecture with Qdrant for vector …
A developer built an open-source AI assistant to help users navigate the EU AI Act, which becomes enforceable on August 2, 2026. The system uses a two-agent CrewAI architecture with Qdrant for vector …
Metadata filtering before vector search is a cheap but often overlooked recall win. By applying a hard predicate on metadata like customer_id, the search space shrinks from millions to hundreds of chu…
A developer built a Quantified-Self RAG (Retrieval-Augmented Generation) system that ingests a decade of personal medical records from messy PDF scans using Unstructured.io, Sentence-Transformers, and…
A RAG pipeline that fails to model patient identity before retrieval cannot deliver clinically relevant answers, according to a system built at a Global AI Hackathon. HPPIE (Hyper-Personalized Patient…
A developer built a RAG system for financial document Q&A that improved accuracy from 10% to 57% on the FinanceBench benchmark, validated against 150 expert-annotated question-answer pairs from SEC fi…
Developer ClaudioDrews released Memory OS, an open-source six-layer memory stack for the Hermes Agent, under an MIT license. The system adds a vector database, structured facts, and an auto-curated kn…
A Rust-based AI agent operating system called Gliding Horse has been released, featuring a CPU-like hierarchical memory architecture and PDCA cycle orchestration for coordinating multiple agents. The …
A developer benchmarked ChromaDB, Qdrant, Weaviate, and pgvector for RAG pipelines, finding that most teams over-optimize for future scale while underestimating day-one operational costs. ChromaDB off…
A developer built the Closiq Discord Agent, a full-stack modular monolith that transforms a Discord channel into an automated AI customer support and lead management system. The system captures conver…
A new empirical study found that layering three retrieval methods—typed discovery, semantic context, and file verification—achieved a 0.954 score for LLM-generated engineering artifacts, outperforming…
A developer built a fully local Retrieval-Augmented Generation (RAG) system using Ollama and TypeScript, requiring no API keys or third-party calls. The 200-line command-line tool indexes `.md` and `.…
HIVE, a decentralized peer-to-peer knowledge base designed as a "Wikipedia for machines" that provides cryptographically verifiable content for LLMs using the Holepunch stack. It uses a two-tier archi…
"RAG Readiness," a tool designed to eliminate the vague "it depends" answers that plague Retrieval-Augmented Generation (RAG) architecture decisions. Instead of providing comparison tables, the tool u…
This article is a transcript from the Stack Overflow podcast featuring Brian O'Grady from Qdrant, who discusses the differences between vector databases and Lucene-based architectures like Elasticsear…