Build an agentic AI healthcare claims pipeline with Amazon Bedrock and AWS HealthLake AWS announced a new solution for automating healthcare claims processing using Amazon Bedrock and AWS HealthLake. The pipeline uses Bedrock Data Automation to extract data from CMS-1500 forms and an AI agent on Bedrock AgentCore to validate and create FHIR resources in HealthLake, reducing manual processing time. Artificial Intelligence https://aws.amazon.com/blogs/machine-learning/ Build an agentic AI healthcare claims pipeline with Amazon Bedrock and AWS HealthLake Manually processing paper-based forms remains a significant cost in the healthcare industry. Despite advancements in data extraction of scanned documents and images, human oversight is usually still needed. Entry error by the individual creating the form or lower-confidence extractions from the digitization still must be remediated. In this post, we show you how to build an automated claims processing pipeline using two key Amazon Bedrock capabilities: Amazon Bedrock Data Automation https://aws.amazon.com/bedrock/bda/ for intelligent document extraction from healthcare claim forms, and Amazon Bedrock AgentCore https://aws.amazon.com/bedrock/agentcore/ for hosting an AI agent that validates and transforms the extracted data into FHIR Fast Healthcare Interoperable Resources resources in AWS HealthLake https://aws.amazon.com/healthlake/ . You will learn how to combine these services to create an end-to-end workflow that reduces manual processing while maintaining accuracy through automated validation checks. Solution overview The solution demonstrates an automated workflow for processing healthcare claim forms using AI-powered services. When a healthcare provider uploads a CMS-1500 claim form in PDF format to an Amazon Simple Storage Service Amazon S3 bucket, it triggers a processing pipeline starting with AWS Lambda https://aws.amazon.com/pm/lambda/ that performs three main functions: - Amazon Bedrock Data Automation extracts structured data from the form using intelligent document processing. - An AI agent using Strands Agents https://strandsagents.com/ running on Amazon Bedrock AgentCore validates this data against existing patient and provider records in AWS HealthLake, checking for completeness and consistency. - If all validations pass, the agent creates a standardized FHIR claim resource in HealthLake. It also generates a technical summary for claims processors and a patient-friendly explanation of the claim status. Both go out as Amazon Simple Notification Service Amazon SNS notifications https://aws.amazon.com/sns/ . This automated workflow helps reduce manual processing time while maintaining accuracy through AI-assisted validation. Figure 1: An architectural view of the solution. The preceding diagram illustrates the following steps: - A submitter uploads a claim document to Amazon S3. - AWS Lambda gets triggered when the file arrives. - Amazon Bedrock Data Automation extracts the information from the document and outputs the result in JSON format. - AWS Lambda then calls AgentCore and passes the document for processing. - AgentCore queries AWS HealthLake, creates the claim, and creates a summary JSON response. - AWS Lambda invokes Amazon SNS to deliver an error response or a success response. Lambda is used as an event trigger when a document is created in S3 and serves as a deterministic supervisor over the agentic workflow. It validates that each document is processed or sent to a dead letter queue for exception handling. Bedrock Data Automation streamlines generative AI development and automates workflows involving documents, images, audio, and videos. For document processing, Bedrock Data Automation combines traditional optical character recognition OCR , machine learning ML models, and generative AI to extract data accurately. You can use Blueprints artifacts to specify what data to extract from a document and how to extract it. You can use pre-built templates or build custom configurations tailored to your use cases. The output includes confidence scores and bounding box data for the extracted fields and tables. The custom output here produces a predictable JSON representation of the CMS-1500 claim form across its format variations. AgentCore hosts the Strands agent. The agent uses two tools to interact with HealthLake: create fhir claim and search fhir resources . The agent uses the following workflow: - Find the Insured, Patient, Practitioner, and Coverage information in AWS HealthLake to use as a reference in the claim form. The first attempt uses direct method calls https://strandsagents.com/docs/user-guide/concepts/tools/ direct-method-calls and default search parameters. Beyond that, the agent runs the following prompt to check the tool calls and re-attempt searches if necessary: Identify the insured resource, first by looking at prior tool calls. If there is no match, try two more attempts to find a match by using different search parameters from the claim JSON. Focus on high confidence score attributes and report how you found the match. - If the references are found, create a FHIR representation of the claim and send it to AWS HealthLake. - Create a JSON object that captures the work completed. The object includes the claim ID if one was created , a response for the human processor, and a response for the patient. The processor response acts as an alert or observation. The patient response signals back to the submitter when errors need to be corrected. Prerequisites Before you deploy the solution, make sure you have the following: - An AWS account with administrator permissions. - Access to Anthropic Claude Sonnet 4.6 on Amazon Bedrock. For more details, see Access Amazon Bedrock foundation models https://docs.aws.amazon.com/bedrock/latest/userguide/model-access.html . - NodeJS version 24 or later. - Node Package Manager npm version 11.5 or later. - Python version 3.13 or later. - AWS Cloud Development Kit AWS CDK version 2.1025 or later. Deploy the solution The AWS Cloud Development Kit CDK and the AgentCore command line interface are used for deployment with the following steps: - Clone the repository: - Run the following commands from the repository root: Subscribe to the SNS topic to receive notifications - Access the Amazon SNS console https://console.aws.amazon.com/sns/ . - Choose Topics . - Choose Agent-Notifications . - Choose Create subscription . - For protocol, choose email . - Enter your email address. - Choose Create subscription . - Follow the link in the confirmation notice in your email to confirm your subscription. Use the solution The following sections walk through two scenarios: a failure scenario and a success scenario. 1. Failure scenario: Simulate a failure by leaving out one of the required reference resources in AWS HealthLake. The project code includes a sampledata folder. Use load sampledata.py to stage some data, where