RAG-Coding: Enhancing LLM Medical Coding with Structured External Knowledge

wpnews.pro

cd /news/large-language-models/rag-coding-enhancing-llm-medical-cod… · home › topics › large-language-models › article

[ARTICLE · art-16056] src=arxiv.org ↗ pub=2026-05-28T04:00Z topic=large-language-models verified=true sentiment=↑ positive

RAG-Coding: Enhancing LLM Medical Coding with Structured External Knowledge

Researchers have developed RAG-Coding, a method that uses four large language model agents to automate ICD-10-CM medical coding by grounding decisions in official coding tabular lists and guidelines. On the MDACE dataset, the approach outperformed existing LLM-based baselines by 8-13% in micro-F1 and 2-8% in macro-F1, while also releasing an updated MDACE-2025 dataset with expert re-annotations aligned to 2025 clinical standards. The findings demonstrate that incorporating structured external knowledge significantly improves coding accuracy and clinical compliance over current automated methods.

read1 min views9 publishedMay 28, 2026

arXiv:2605.27377v1 Announce Type: new Abstract: We present RAG-Coding, an agentic method for automated ICD-10-CM coding. RAG-Coding orchestrates four large language model (LLM) agents and grounds their coding decisions in external knowledge sources (e.g. the official coding tabular list and guidelines). By retrieving and cross-referencing relevant knowledge in these sources, the agents enhance coding accuracy and ensure clinical compliance. On the MDACE dataset, RAG-Coding outperforms the best LLM-based baseline by 8-13% in micro-F1 and 2-8% in macro-F1 across multiple LLM backbones. Compared to the state-of-the-art pretrained language model method, PLM-ICD, RAG-Coding exhibits higher micro recall (+11%), while PLM-ICD exhibits higher micro precision (+6%), yielding comparable micro- and macro-F1. Ablations show stepwise gains, highlighting the importance of incorporating external knowledge. We also release MDACE-2025, updating the original dataset with expert re-annotations with the latest 2025 ICD-10-CM guidelines. This update features more fine-grained code labels and enables evaluation against current clinical standards.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/rag-coding-enhancing-llm…

Read original on arxiv.org → arxiv.org/abs/2605.27377

mentioned entities

RAG-Coding

ICD-10-CM

MDACE

PLM-ICD

MDACE-2025

metadata

slugrag-coding-enhancing-llm-medical-coding-with-structured-external-knowledge

topic#large-language-models

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevOpen House 2026 Day 1: real-time…

next →New poll points to possible Bece…

── more in #large-language-models 4 stories · sorted by recency

dev.to · 15 Jul · #large-language-models

Why did my benchmark stop at N=22? A debugging story in nine bugs

wired.com · 15 Jul · #large-language-models

AI Isn’t Smarter Than a Baby—Yet

dev.to · 15 Jul · #large-language-models

My benchmark's Python column was N/A for a year — CPython's 4300-digit limit, and eight other bugs

networkworld.com · 15 Jul · #large-language-models

IBM targets AI edge with Power server, software upgrades

── more on @rag-coding 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required