cd /news/large-language-models/rag-coding-enhancing-llm-medical-cod… · home topics large-language-models article
[ARTICLE · art-16056] src=arxiv.org pub= topic=large-language-models verified=true sentiment=↑ positive

RAG-Coding: Enhancing LLM Medical Coding with Structured External Knowledge

Researchers have developed RAG-Coding, a method that uses four large language model agents to automate ICD-10-CM medical coding by grounding decisions in official coding tabular lists and guidelines. On the MDACE dataset, the approach outperformed existing LLM-based baselines by 8-13% in micro-F1 and 2-8% in macro-F1, while also releasing an updated MDACE-2025 dataset with expert re-annotations aligned to 2025 clinical standards. The findings demonstrate that incorporating structured external knowledge significantly improves coding accuracy and clinical compliance over current automated methods.

read1 min publishedMay 28, 2026

arXiv:2605.27377v1 Announce Type: new Abstract: We present RAG-Coding, an agentic method for automated ICD-10-CM coding. RAG-Coding orchestrates four large language model (LLM) agents and grounds their coding decisions in external knowledge sources (e.g. the official coding tabular list and guidelines). By retrieving and cross-referencing relevant knowledge in these sources, the agents enhance coding accuracy and ensure clinical compliance. On the MDACE dataset, RAG-Coding outperforms the best LLM-based baseline by 8-13% in micro-F1 and 2-8% in macro-F1 across multiple LLM backbones. Compared to the state-of-the-art pretrained language model method, PLM-ICD, RAG-Coding exhibits higher micro recall (+11%), while PLM-ICD exhibits higher micro precision (+6%), yielding comparable micro- and macro-F1. Ablations show stepwise gains, highlighting the importance of incorporating external knowledge. We also release MDACE-2025, updating the original dataset with expert re-annotations with the latest 2025 ICD-10-CM guidelines. This update features more fine-grained code labels and enables evaluation against current clinical standards.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/rag-coding-enhancing…] indexed:0 read:1min 2026-05-28 ·