cd /news/large-language-models/detecting-hallucinations-for-large-l… · home topics large-language-models article
[ARTICLE · art-33540] src=arxiv.org ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Detecting Hallucinations for Large Language Model-based Knowledge Graph Reasoning

Researchers proposed LUCID, the first hallucination detection method for large language model-based knowledge graph reasoning, which jointly leverages LLM attention scores, KG semantics, and structural information. Experiments on nine datasets show LUCID achieves state-of-the-art performance compared to 15 baselines, addressing a critical gap in detecting hallucinations that cause misinformation in KG reasoning tasks.

read1 min views1 publishedJun 19, 2026

arXiv:2606.19351v1 Announce Type: new Abstract: Knowledge graph (KG) reasoning infers new knowledge from existing facts and is widely applied in question answering, recommendation, and decision support. With the rapid development of large language models (LLMs), LLM-based KG reasoning frameworks have become increasingly popular by leveraging retrieved KG information. However, hallucinations in LLMs remain a critical issue. Even when relevant KG knowledge is incorporated, models may still generate incorrect outputs, leading to misinformation and unreliable decisions. Existing hallucination detection methods either focus on LLM internal states or verify consistency with retrieved contexts, but both overlook the structural information in KGs, resulting in suboptimal performance. To address this gap, we propose LUCID, the first halLUcination deteCtIon method for LLM-based knowleDge graph reasoning frameworks. LUCID jointly leverages LLM attention scores, KG semantics, and structural information. Specifically, it extracts node and edge features from attention scores and semantic similarities, and integrates them with KG structure using a graph neural network. We also construct manually annotated benchmark datasets for evaluation. Experiments on nine datasets show that LUCID achieves state of the art performance compared to 15 baselines.

── more in #large-language-models 4 stories · sorted by recency
── more on @lucid 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/detecting-hallucinat…] indexed:0 read:1min 2026-06-19 ·