Why AI can't be trusted to write scientific reviews

wpnews.pro

cd /news/artificial-intelligence/why-ai-can-t-be-trusted-to-write-sci… · home › topics › artificial-intelligence › article

[ARTICLE · art-19053] src=nature.com ↗ pub=2026-05-31T09:06Z topic=artificial-intelligence verified=true sentiment=↓ negative

Why AI can't be trusted to write scientific reviews

The Cochrane Collaboration's editor-in-chief warns that current AI tools are not ready to replace humans in conducting systematic reviews of scientific literature, citing risks of errors that could mislead patients or waste healthcare resources. Testing at the London-based publisher found that AI models require lengthy training, often take longer than manual work, and rely on opaque proprietary systems that may introduce bias, particularly when evaluating drugs and medical devices. The editor argues that developers should shift from using AI to generate individual reviews toward building systems that enable effective human-AI collaboration.

read2 min views17 publishedMay 31, 2026

Artificial-intelligence tools are being touted as a means to conduct rapid reviews of scientific literature. At the London-based publisher the Cochrane Collaboration, where I became editor-in-chief in March, we specialize in health-related systematic reviews: the highest-quality syntheses of all the available research. We are testing ways to use AI to increase our reviews’ efficiency and scale. But, in our experience, the current tools are far from ready for mainstream adoption, and the assumption that machines can replace humans on all methodological tasks is flawed.

The stakes are high. Systematic reviews and other types of evidence synthesis inform clinical practice, public-health guidance and policy decisions that affect entire populations. Errors could give false hope to patients or lead health systems to waste money on ineffective or unsafe interventions.

Will AI speed up literature reviews or derail them entirely?

Current AI models typically replicate the step-by-step processes by which people conduct systematic reviews: identifying suitable studies from disparate sources; extracting relevant data for analysis; and, finally, writing up the report. The idea is to replace the work of humans.

But conducting systematic reviews is not a purely computational task. Human specialists are needed to define meaningful review questions, evaluate relevance, interpret results and understand clinical or policy implications. Context and subjective nuance are seldom well-represented in AI models’ training data, and the models’ tendency to hallucinate — that is, to fabricate information — means that their outputs need to be verified by human experts.

Efforts at Cochrane show the limitations of using AI in place of people. We’ve been evaluating tools that support study screening and data extraction. These are time-consuming processes to conduct manually, particularly when primary data are not readily accessible and must be drawn from multiple sources or inferred from published results.

But we’ve found that most of the tools available were developed by private companies. This is problematic for reviews that evaluate drugs and medical devices, because these need to be independent of industry. What’s more, few AI models are open source, with most relying on opaque, proprietary ‘black box’ processes. This means there’s no way to examine whether a tool might disproportionately include trials with results favourable to one drug company.

Scientists are building giant ‘evidence banks’ to create policies that actually work

And, on the practical side, tools in the current generation require long training periods for both the AI and the human operator before they yield reliable results. So far, we’ve found that, for each review, the whole process takes longer than doing the work manually.

In my view, to realize the full potential of AI, it’s crucial that tool developers, authors and evidence users move away from using it to generate individual reviews. Instead of mimicking human processes, developers should start building systems that allow humans and AI to work together effectively to assess studies.

source & further reading

nature.com — original article 'Humanizer' tool can erase signs of AI-written text Nobel-winning chemist leaves US to direct AI materials lab in China Large language models can predict the results of social science experiments

~/api · this article 200

$curl api.wpnews.pro/v1/news/why-ai-can-t-be-trusted-…

Read original on nature.com → www.nature.com/articles/d41586-026-01616-3

mentioned entities

Cochrane Collaboration

metadata

slugwhy-ai-can-t-be-trusted-to-write-scientific-reviews

topic#artificial-intelligence

secondary4 topics

sentimentnegative

canonicalnature.com

navigation

← prevWhy Chinese AI labs went open an…

next →#DAY1: The story of unemployment…

── more in #artificial-intelligence 4 stories · sorted by recency

dev.to · 16 Jul · #artificial-intelligence

Stratagems #15: Derek and Alex Shared One Server. ACL's AI Was Listening to Both.

insertchaos.bearblog.dev · 15 Jul · #artificial-intelligence

Will there be any more bugs to find after AI has fixed them all?

pub.towardsai.net · 16 Jul · #artificial-intelligence

I Built a Hybrid RAG App That Talks to My PDF — and Knows When to Say “I Don’t Know”

lesswrong.com · 15 Jul · #artificial-intelligence

Recap of bike trip/street interviews across America

── more on @cochrane collaboration 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #ai-chips

D-Matrix launches Corsair AI inference platform, challenging Nvidia’s GPU dominance

wpnews · 8 Jul · #artificial-intelligence

What Is Vibe Coding? How AI Builds Games From Scratch

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required