cd/entity/Qdrant· home› entities› Qdrant

grep -l @qdrant /news/*.json | wc -l → 82

Qdrant

mentions 82 type Organization page 1/5 feed RSS

// recent coverage 82 mentions

18:05

2026-07-28

dev.to

developer-tools

Self-hosting ArangoDB: one database for graph, document, and vector search

A developer advocates for self-hosting ArangoDB, a multi-model database that natively supports graph, document, and vector search with a single query language (AQL), eliminating the need to duct-tape …

14:55

2026-07-27

discuss.huggingface.co

artificial-intelligence

Voice agent latency degrades after turn 7-8 despite fixed system prompt + limited history — looking for mitigation ideas beyond what we've already tried

Voice agent latency degrades after turn 7-8 in production, occasionally crossing Retell's ~3s reconnect threshold and dropping calls, despite using keep_alive: -1, KV cache reuse, and two-WebSocket ra…

11:47

2026-07-26

pub.towardsai.net

artificial-intelligence

Building a Self-Evaluating RAG Agent with LangGraph, Qdrant Hybrid Search & Phoenix

A new open-source RAG agent combines Qdrant hybrid search (dense + BM25 with RRF fusion), LangGraph ReAct routing with safety overrides, an MCP server for tool exposure, and Arize Phoenix for observab…

16:44

2026-07-24

dev.to

developer-tools

trelix v2.7 to v2.9: The Release Where the Pipeline Itself Became the Product

A developer shipped trelix v2.7.0 with a release pipeline bug that caused binary assets to collide, making it impossible to tell whether the surviving binary was macOS or Linux. Over six subsequent re…

15:18

2026-07-24

dev.to

artificial-intelligence

Qdrant vs Pinecone: Self-Hosted Vector Search for Production RAG

A developer compares Qdrant and Pinecone for production RAG systems, highlighting that Qdrant is an open-source vector search engine that can be self-hosted while Pinecone is a fully managed cloud ser…

11:33

2026-07-23

blog.stackademic.com

artificial-intelligence

Hybrid Retrieval Under the Microscope: BM25 vs MiniCOIL on MedQuAD

A controlled experiment comparing hybrid retrieval pipelines using BM25 versus miniCOIL on the MedQuAD dataset with EmbeddingGemma and Qdrant shows that miniCOIL, a sparse neural retrieval model, impr…

17:14

2026-07-22

dev.to

artificial-intelligence

Building Production-Ready RAG Applications: A Practical Guide

A developer's practical guide details the engineering challenges and solutions for deploying production-ready Retrieval-Augmented Generation (RAG) applications, covering data indexing, vector stores, …

08:44

2026-07-20

discuss.huggingface.co

artificial-intelligence

Real-time voice agents with local LLMs: the latency problem nobody fully solves

Real-time voice agents using local LLMs face a hard latency ceiling of ~3 seconds imposed by Retell AI's WebSocket timeout, which forces reconnections and call termination if the first token is not pr…

06:56

2026-07-19

github.com

artificial-intelligence

Visualizing how multimodal vector search works under the hood

A new open-source project demonstrates how multimodal vector search works under the hood, using OpenAI CLIP (ViT-B/32) to embed text and images into a 512-dimensional vector space and Qdrant (HNSW ind…

12:05

2026-07-18

dev.to

artificial-intelligence

Retrieval-Augmented Self-Recall — Part 2: Hybrid RAG on Nothing but Postgres

A developer built RE-call, a retrieval-augmented memory system for AI agents that runs entirely on PostgreSQL, using pgvector for dense vector search and built-in full-text search, fused via Reciproca…

12:05

2026-07-18

dev.to

large-language-models

Taking Over LLM Memory Store Testing with Pytest: 90% Fewer State Inconsistencies

A developer at an AI startup reduced memory state inconsistencies by 90% by replacing manual testing with a Pytest-based automated test suite for LLM memory stores. The new approach verifies consisten…

06:09

2026-07-17

pub.towardsai.net

ai-agents

Building AI Agents in Rust - part 8

Eugene v0.8 introduces memory for AI agents built in Rust, shipping two complementary stores, a VectorStore trait, and agentic RAG through new skills 'remember' and 'recall'. The memory system uses ma…

17:04

2026-07-16

dev.to

artificial-intelligence

The LLM Was the Easy Part: Building a Hybrid RAG API

A developer built a hybrid RAG API that combines dense and sparse retrieval with reciprocal rank fusion (RRF) to answer questions from PDFs. The system uses Qdrant for vector storage, a cross-encoder …

13:00

2026-07-16

dev.to

machine-learning

Vector Search — how HNSW finds nearest neighbours

HNSW (Hierarchical Navigable Small World) is a graph-based algorithm that powers vector search in FAISS, pgvector, Qdrant, Weaviate, and Milvus, enabling approximate nearest neighbor search in millise…

13:47

2026-07-14

dev.to

artificial-intelligence

Building a Robust RAG Pipeline Architecture for Production

A developer built a modular RAG pipeline architecture for production, using Docker containers on Cloud Run, with external configuration to swap components. The pipeline uses 500-character chunks with …

13:35

2026-07-12

dev.to

artificial-intelligence

Anatomy of a Full RAG Application: Every Concept, One Self-Hosted Stack

A developer built myRAG, a fully self-hosted RAG stack combining FastAPI, React, Qdrant, PostgreSQL, and Neo4j. The system uses hybrid search with dense and sparse embeddings, cross-encoder reranking,…

00:19

2026-07-11

dev.to

artificial-intelligence

Quantified Self 2.0: Stop Guessing Your Health History—Build a Personal Medical Vector Database

A developer built a personal health knowledge base using a vector database and RAG pipeline to organize scattered medical records. The system uses Qdrant for similarity search, Unstructured.io for par…

21:21

2026-07-10

dev.to

ai-agents

Mem0 vs TurboMem: which memory layer actually fits your TypeScript agent

A developer compares Mem0 and TurboMem as memory layers for TypeScript AI agents. TurboMem runs as a native TypeScript library inside the process, avoiding the separate service and infrastructure requ…

23:01

2026-07-09

pub.towardsai.net

ai-infrastructure

How Qdrant Reduced RAG Token Costs by 67% with Native ColBERT Reranking

Qdrant reduced retrieval-augmented generation token costs by 67% using native ColBERT reranking, which performs token-to-token matrix comparison inside the database in a single query call, eliminating…

16:00

2026-07-09

dev.to

machine-learning

How Vector Search Actually Works: IVF and HNSW

A developer explains how vector search works under the hood, focusing on the two dominant algorithms: IVF (Inverted File Index) and HNSW (Hierarchical Navigable Small World). The post details why appr…

page 1 / 5 next →

// co-occurs with top 8 entities

Pinecone 21 pgvector 20 OpenAI 18 Weaviate 15 FastAPI 12 Ollama 11 Milvus 10 HNSW 9

// topics top 6 topics

large language models 64 developer tools 51 ai infrastructure 50 artificial intelligence 49 machine learning 36 ai tools 33