The Legal Agent Stack

wpnews.pro

cd /news/large-language-models/the-legal-agent-stack · home › topics › large-language-models › article

[ARTICLE · art-29411] src=lexifina.com ↗ pub=2026-06-16T12:06Z topic=large-language-models verified=true sentiment=· neutral

The Legal Agent Stack

Alan Yahya outlines the evolution of LLM agents from fine-tuning to optimizing agent workflows, emphasizing the need for features like session management and in-line editing. He notes that vertical applications differentiate through memory management and workspace design, citing Cursor and Claude Code as examples. Consolidation trends in legal tech, such as Filevine acquiring Pincites, reflect a shift toward integrated agent workspaces.

read3 min views24 publishedJun 16, 2026

Alan Yahya3 min read

Background #

A lot of LLM use revolves around agents. Unlike a simple chat completion, an agent will analyse your message, create a plan, and apply tools to achieve a task. LLMs are stateless, they only see an input and output, so an agent needs to be able to manage context and delegate tasks.

To improve these systems, we used to directly edit the LLM weights. This was a trade-off, as many of the emergent properties of an LLM come from their generality, which model fine tuning would inevitably degrade.

Overall, fine tuning would work well for problems that could be solved in one shot. However, most real-world tasks are solved by an agent in multiple steps. As a result, most approaches now focus on optimising the agent, rather than altering the LLM directly.

Working with agents #

As agent workflows mature, users will come to expect a common set of features, which make LLMs easier to work with. For agent-heavy workflows, these features quickly become essential.

This includes managing how knowledge is retained, including the ability to fork, rewind or export agent sessions, which can span hundreds of thousands of lines of text. It also includes managing how changes are applied, such as the ability to review suggestions in-line, alter suggestions using different context, or merge multiple changes logically.

Vertical applications #

Once a core agent workflow is established, each vertical can begin to differentiate itself. Firstly, in terms of how the persistent agent memory is managed. The information you capture, and the hierarchy of information, will depend on the field you are working in. For example in coding you might persist top-level functions. In law, you might persist common entities between sections and clauses.

Secondly, in terms of the design of the workspace itself. One example is Cursor, which was initially a plugin, and now provides a standalone desktop application. Another example is the terminal interface, like Claude Code, which is a very high level of abstraction. Users with decades of familiarity using a particular app (like MS Word) will naturally prefer the latter. Particularly in a document where you need to audit every sentence, most users are more comfortable doing so in Word over a terminal.

Still, it leaves us in an interesting situation. We have multi-agent multi-document applications within a tool designed for single document editing (MS Word). Most AI providers provide a web application, where they have more control over how the application works, but moving users off tools like Word will be a gradual process.

Consolidation #

It is increasingly common to glue multiple apps together using a MCP or similar connector. This works, but can be a clumsy solution, as custom connectors between systems not built for interoperability can surface problems and edge cases.

There are obvious advantages to software consolidation, that are compounded by the high volume nature of LLMs. In the legal technology space, we can see this with infrastructure tools acquiring document editors (IE Filevine acquiring Pincites or Relativity acquiring Gavel).

This fits into an overarching picture, one of software companies locked in an existential struggle. These companies are no longer content to own a single slice of the stack, and as a result we see many of them building out their own complete agent workspace.

source & further reading

lexifina.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-legal-agent-stack

Read original on lexifina.com → lexifina.com/blog/the-legal-agent-stack

mentioned entities

Alan Yahya

Cursor

Claude Code

Filevine

Pincites

Relativity

Gavel

MS Word

metadata

slugthe-legal-agent-stack

topic#large-language-models

secondary4 topics

sentimentneutral

canonicallexifina.com

navigation

← prevCo-Op and Next among firms launc…

next →Hong Kong Science Park, SenseTim…

── more in #large-language-models 4 stories · sorted by recency

runtimewire.com · 31 Jul · #large-language-models

Supabase launches open benchmark for AI coding agents building backends

promptcube3.com · 31 Jul · #large-language-models

Claude Skills, Benchmarked: How I Use Them Without the Hype

github.com · 31 Jul · #large-language-models

Sessiongrep: Local-first memory layer for CLI agents

alanyahya.com · 15 Jun · #large-language-models

"Cursor for X": key standards for vertical products offering agent workflows

── more on @alan yahya 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 31 Jul · #ai-products

E J Ziyad launches UML, a shared memory graph for Claude and ChatGPT

wpnews · 31 Jul · #artificial-intelligence

OpenAI Slashes GPT-5.6 Prices as Tech Giants Wage War Over Enterprise AI Spending

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required