Query-Adaptive Semantic Chunking for Retrieval-Augmented Generation: A Dynamic Strategy with Contextual Window Expansion

wpnews.pro

cd /news/artificial-intelligence/query-adaptive-semantic-chunking-for… · home › topics › artificial-intelligence › article

[ARTICLE · art-13629] src=arxiv.org ↗ pub=2026-05-25T04:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

Query-Adaptive Semantic Chunking for Retrieval-Augmented Generation: A Dynamic Strategy with Contextual Window Expansion

Researchers have developed Query-Adaptive Semantic Chunking (QASC), a method that integrates user queries into the document segmentation process for Retrieval-Augmented Generation systems. In tests on 100 technical documents with 200 queries, QASC achieved an F1-score of 0.85, outperforming fixed chunking by 18-27% and semantic or agentic methods by 8-12%. The approach uses cosine similarity scoring, contextual window expansion, and chunk-level score aggregation to dynamically construct chunks that preserve coherence and relevance.

read1 min views10 publishedMay 25, 2026

arXiv:2605.22834v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems depend critically on document chunking quality for retrieving relevant context. Fixed chunking segments documents into uniform units irrespective of semantics or user intent, producing a precision-recall trade-off unresolvable by tuning chunk size alone. Semantic and agentic methods partially address these limitations but do not integrate user queries at the chunking stage. We present Query-Adaptive Semantic Chunking (QASC), which dynamically constructs chunks by integrating queries into segmentation through three mechanisms: cosine similarity scoring between sentence and query embeddings to identify seed sentences, contextual window expansion around seeds to preserve coherence, and chunk-level score aggregation to ensure holistic relevance. We evaluate QASC on 100 technical documents across 200 queries spanning four types, comparing against fixed chunking at five granularities, recursive splitting, semantic chunking, and agentic chunking. QASC achieves an F1-score of 0.85, a relative improvement of 18-27% over fixed chunking and 8-12% over semantic and agentic alternatives. Ablation studies confirm each component contributes meaningfully. Human evaluation by three annotators (Cohen kappa = 0.82) corroborates that QASC produces more relevant and coherent chunks than existing methods.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/query-adaptive-semantic-…

Read original on arxiv.org → arxiv.org/abs/2605.22834

mentioned entities

QASC

Retrieval-Augmented Generation

RAG

metadata

slugquery-adaptive-semantic-chunking-for-retrieval-augmented-generation-a-dynamic

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevThe Eternal Sloptember

next →Samsung memory workers call off …

── more in #artificial-intelligence 4 stories · sorted by recency

machinebrief.com · 10 Jul · #artificial-intelligence

Cancer: Can AI Models Truly Predict Tumor Stages?

eventual.ai · 10 Jul · #artificial-intelligence

Querying Physical AI Data with Daft

cryptobriefing.com · 10 Jul · #artificial-intelligence

Google’s new quantum calibration breakthrough brings post-quantum crypto timeline into sharper focus

machinebrief.com · 10 Jul · #artificial-intelligence

AI Model Calibration with NDVM

── more on @qasc 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

AI Tokenomics: How to tokenmin while ROImaxxing

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required