Stop Blocking Virtual Threads: Building Asynchronous Human-in-the-Loop AI Agents with Spring AI

wpnews.pro

cd /news/ai-agents/stop-blocking-virtual-threads-buildi… · home › topics › ai-agents › article

[ARTICLE · art-21252] src=dev.to ↗ pub=2026-06-04T07:08Z topic=ai-agents verified=true sentiment=↓ negative

Stop Blocking Virtual Threads: Building Asynchronous Human-in-the-Loop AI Agents with Spring AI

A developer has built an asynchronous human-in-the-loop AI agent system using Spring AI that avoids blocking virtual threads during approval workflows. The approach serializes the agent's ReAct loop state—including token history, tool call IDs, and pending variables—to a Redis-backed persistent store, terminates the active thread immediately, and hydrates a new agent instance when a human decision webhook fires. The implementation uses a custom `ChatMemory` adapter that supports snapshotting at specific message indices and a resume endpoint that injects the human's decision as a `ToolResponseMessage` to continue the ReAct loop.

read2 min views24 publishedJun 4, 2026

In 2026, letting autonomous AI agents execute high-risk enterprise tools without human oversight is a production liability, but blocking platform threads—or even Project Loom’s virtual threads—for hours waiting for a manager's Slack approval is absolute architectural malpractice. We must transition from synchronous execution loops to stateless, event-driven agent hydration where the LLM's reasoning state is serialized and persisted during human-in-the-loop (HITL) interrupts.

VirtualThreadExecutor

) solve the wait problem—they do not; holding resources open for a 4-hour human coffee break destroys system scalability and ruins connection pools.ChatMemory

or agent context) in local heap memory, making your system highly vulnerable to redeployments and node failures.CompletableFuture

or busy-waiting database polling loops to check if a human has clicked "Approve" on an external UI.The clean solution is to serialize the agent's execution state—the ReAct loop token history, tool call IDs, and pending variables—to a persistent store, terminate the active thread immediately, and hydrate a brand-new agent instance when the approval webhook fires.

AgentSuspensionException

containing the serialized stateId

and tool execution metadata when a high-risk tool is triggered.ChatClient

with a custom Redis-backed ChatMemory

implementation that supports snapshotting at specific message indices./api/v1/agent/resume

that accepts the human decision, merges it into the serialized history as a ToolResponseMessage

, and triggers the next step of the ReAct loop.

@PostMapping("/agent/resume")
public ResponseEntity<String> resumeAgent(@RequestBody ApprovalResponse approval) {
    // 1. Retrieve serialized chat history (ReAct state) from Redis
    List<Message> history = stateRepository.findById(approval.stateId());

    // 2. Inject the human's decision as if it were the tool's output
    String toolOutput = approval.approved() ? "Approved: " + approval.notes() : "Rejected by human";
    history.add(new ToolResponseMessage(approval.toolCallId(), toolOutput));

    // 3. Hydrate the agent and resume execution without blocking threads
    ChatResponse response = chatClient.prompt()
        .messages(history)
        .call()
        .chatResponse();

    return ResponseEntity.ok(response.getResult().getOutput().getContent());
}

ChatMemory

adapters to dynamically hydrate and dehydrate context windows on demand.

Heads up:if you want to see these patterns applied to real interview problems,[javalld.com]has full machine coding solutions with traces.

source & further reading

dev.to — original article Teaching Agents to Slow Down Where It Matters Introducing Radar: An Open-Source, Self-Hosted AI Media Intelligence Platform Cross-Vendor Audit: What It Caught in My Own Model's Writing, and What It Got Wrong

~/api · this article 200

$curl api.wpnews.pro/v1/news/stop-blocking-virtual-th…

Read original on dev.to → dev.to/machinecodingmaster/stop-blocking-virtual…

mentioned entities

Project Loom

Spring AI

Slack

Redis

ReAct

ChatMemory

ChatClient

VirtualThreadExecutor

metadata

slugstop-blocking-virtual-threads-building-asynchronous-human-in-the-loop-ai-agents

topic#ai-agents

secondary4 topics

sentimentnegative

canonicaldev.to

navigation

← prevHis AI Said 'Swap the PSU.' He S…

next →Nobody needs Mythos or 0-days to…

── more in #ai-agents 4 stories · sorted by recency

byteiota.com · 19 Jul · #ai-agents

Claude Fable 5 Developer Guide: API, Pricing, Refusals

shikigami.dev · 19 Jul · #ai-agents

Show HN: Shikigami, run AI coding agents in parallel, each in a Git worktree

dev.to · 19 Jul · #ai-agents

One Missed Test Case Cost Me 8 Hours — How I Built a Zero-Regression Memory Test Suite with Pytest + Docker

dev.to · 19 Jul · #ai-agents

Teaching Agents to Slow Down Where It Matters

── more on @project loom 3 stories trending now

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 18 Jul · #artificial-intelligence

Ada: An AI business intelligence software from CSV and Excel(yes LLMs but more)

wpnews · 8 Jul · #ai-chips

D-Matrix launches Corsair AI inference platform, challenging Nvidia’s GPU dominance

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required