RAS: Reflection-Augmented Scaling with In-Context Learning for Executable Cypher Query Generation

wpnews.pro

cd /news/large-language-models/ras-reflection-augmented-scaling-wit… · home › topics › large-language-models › article

[ARTICLE · art-13632] src=arxiv.org ↗ pub=2026-05-25T04:00Z topic=large-language-models verified=true sentiment=↑ positive

RAS: Reflection-Augmented Scaling with In-Context Learning for Executable Cypher Query Generation

Researchers introduced Reflection-Augmented Scaling (RAS), a method that uses execution error messages from databases to improve Cypher query generation through in-context learning. Testing across three Neo4j datasets and five language models, RAS reduced query execution error rates by 41–50% at five attempts, outperforming the 32–38% reduction achieved by independent resampling. The findings demonstrate that database-generated error messages serve as actionable feedback, making inference-time compute more efficient for producing executable queries than scaling independent samples.

read1 min views13 publishedMay 25, 2026

arXiv:2605.22937v1 Announce Type: new Abstract: Inference-time scaling can reduce errors in structured query generation, but methods to allocate the compute for query code generation remains underexplored. We study Text2Cypher, where language models generate Cypher queries that execute against property graph databases. Non-executable queries constitute a distinct syntactic failure separate from semantic inaccuracy: a syntax error triggers a system-generated error message from the database. These error messages are typically discarded at inference time rather than leveraged through in-context learning (ICL). We compare two inference methods: Independent Scaling (IS), which performs memoryless resampling, and Reflection-Augmented Scaling (RAS), which conditions each new attempt on prior execution feedback via ICL. Across three Neo4j datasets and five code-specialized language models, RAS reduces the Query Execution Error Rate by 41--50% at n{=}5, outperforming IS at 32--38%. Execution errors are not merely failures to discard but actionable feedback, and structuring inference-time compute around them is a more efficient path to executability than scaling independent samples.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/ras-reflection-augmented…

Read original on arxiv.org → arxiv.org/abs/2605.22937

mentioned entities

Neo4j

RAS

Text2Cypher

Cypher

metadata

slugras-reflection-augmented-scaling-with-in-context-learning-for-executable-cypher

topic#large-language-models

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevThe Eternal Sloptember

next →Samsung memory workers call off …

── more in #large-language-models 4 stories · sorted by recency

neo4j.com · 22 Jun · #large-language-models

Zero-Copy Graph Reasoning on Snowflake: Getting Started With Neo4j Virtual Graph

gpuquicklist.com · 9 Jul · #large-language-models

GPU Quicklist – AI PC/Mac

thenextweb.com · 9 Jul · #large-language-models

A four-year-old has seen more of the world than ChatGPT. Yann LeCun is betting $1bn on that

dev.to · 9 Jul · #large-language-models

How Vector Search Actually Works: IVF and HNSW

── more on @neo4j 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required