cd /news/ai-tools/show-hn-scholar-sidekick-citation-ve… · home topics ai-tools article
[ARTICLE · art-19668] src=scholar-sidekick.com pub= topic=ai-tools verified=true sentiment=· neutral

Show HN: Scholar Sidekick – citation verifier for the "real DOI, wrong paper"

A clinician-educator built Scholar Sidekick, a citation verification tool that catches fabricated references where a real DOI points to the wrong paper. In testing 350 citations, the tool correctly identified all 37 fake references but wrongly flagged 5 of 285 real ones, a 1.8% false positive rate. The free web version and paid API aim to address a pattern of citation hallucination found in 1 in 277 PubMed Central articles.

read2 min publishedJun 2, 2026

One of the harder AI citation failures is quite simple: the identifier is real, but the citation is still fake. The DOI resolves, but to a different paper - not the paper the citation claims it is.

Topaz et al. reported their findings on citation hallucination in May in The Lancet. They scanned 2.5 million PubMed Central articles and estimated that 1 in 277 contained a fabricated citation. Some of their examples were this exact pattern: real identifier, fabricated title.

I originally built Scholar Sidekick as a formatter for my own use as a clinician-educator preparing talks, articles, etc. After reading the Topaz paper, I added a verifier to catch the most common pattern they found: a real identifier attached to the wrong paper.

My tool resolves the identifier, and then compares the title in your reference with the returned metadata (i.e. does this DOI, PMID, or arXiv ID actually point to the right paper?). It does not attempt to judge whether the cited paper actually supports the claim you make in your text. That still needs judgment, preferably human judgment.

I ran 350 previously unseen citations through the API once each in a test. It correctly identified all 37 fabricated references, but wrongly flagged 5 of 285 real references: 1.8% (95% CI 0.8–4.0%). (Plain similarity comparison, without the optional LLM screening - I would expect the LLM to rescue some of those borderline cases. A handful of citations returned no result on upstream timeouts and weren't scorable either way.) The test suite, results and failures are public, so you do not have to take my word for it. You can check them yourself.

The web version is free and anonymous. The REST API and MCP server use a RapidAPI key, with a free rate-limited tier and paid tiers above that. The MCP server is on npm, Smithery and Glama, and the Obsidian plugin is in the community store. Chrome/Firefox/Edge browser extensions in their stores as well.

I'm very open to feedback and look forward to hearing from anyone who tries it - what works? What fails? Thanks in advance.

Comments URL: [https://news.ycombinator.com/item?id=48377215](https://news.ycombinator.com/item?id=48377215)

Points: 1

── more in #ai-tools 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/show-hn-scholar-side…] indexed:0 read:2min 2026-06-02 ·