SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

wpnews.pro

cd /news/large-language-models/safellm-extraction-as-a-hallucinatio… · home › topics › large-language-models › article

[ARTICLE · art-24827] src=arxiv.org ↗ pub=2026-06-12T04:00Z topic=large-language-models verified=true sentiment=· neutral

SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

Researchers at arXiv have proposed SafeLLM, an extraction-based method for retrieval-augmented generation that eliminates hallucinations by directly selecting relevant sentences from source documents rather than rewriting content. In tests on NHS and NICE clinical guidelines, line-number-based source selection achieved up to 95% term recall while maintaining close alignment with original text, outperforming both direct copying and safety-focused rewriting strategies. The findings suggest that extraction-based approaches offer a more reliable alternative for deploying LLMs in safety-critical settings such as healthcare and compliance, where factual accuracy is paramount.

read1 min views30 publishedJun 12, 2026

arXiv:2606.12897v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to access organisational documentation, including standard operating procedures (SOPs), HR policies and institutional guidelines. However, retrieval-augmented generation (RAG) systems that rely on free-form rewriting can introduce hallucinations and unstable trade-offs between completeness and conciseness, particularly in safety- and compliance-critical settings. Objectives: To evaluate extraction as a hallucination-resistant alternative to rewriting-based RAG and compare strategies that balance precision, recall and safety across document types and model scales. Methods: We compare multiple prompting strategies, including line-number-based source selection, extraction of relevant guideline sentences with explicit safety annotations, and a multi-stage pipeline that refines draft answers using supporting evidence from source guidelines. Experiments are conducted on documents of varying length and structure, including local NHS acute care and oncology guidelines and UK-wide NICE guidelines, using both frontier-scale and locally deployable models. Performance is assessed using automatic metrics and human expert evaluation of relevance and completeness. Results: Line-number selection achieves the strongest results, outperforming direct copying and safety-focused strategies across both large and small models while maintaining high term recall (up to 95%) and close alignment with source text. Safety-oriented approaches improve precision but introduce systematic omissions, while multi-stage filtering further amplifies this trade-off. Performance varies with document structure: line-based extraction excels in protocol-like content, whereas alternative strategies perform better on more verbose documents (up to 97% term recall).

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/safellm-extraction-as-a-…

Read original on arxiv.org → arxiv.org/abs/2606.12897

mentioned entities

NHS

NICE

metadata

slugsafellm-extraction-as-a-hallucination-resistant-alternative-to-rewriting-in

topic#large-language-models

secondary3 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevLinear Coding Sessions

next →Can KKR Outmaneuver One of the B…

── more in #large-language-models 4 stories · sorted by recency

byteiota.com · 1 Aug · #large-language-models

Black Hat 2026: AI Agent Security Findings Developers Must Know

depthfirst.com · 31 Jul · #large-language-models

Dfs-Large1: New Frontier for Cybersecurity

figshare.com · 31 Jul · #large-language-models

Framework choice explains ~0.06% of agentic AI security outcome (7,020 trials)

byteiota.com · 31 Jul · #large-language-models

Chinese Military Used GPT-3.5 and Claude for Defense AI

── more on @nhs 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 31 Jul · #ai-products

E J Ziyad launches UML, a shared memory graph for Claude and ChatGPT

wpnews · 31 Jul · #artificial-intelligence

OpenAI Slashes GPT-5.6 Prices as Tech Giants Wage War Over Enterprise AI Spending

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required