cd /news/large-language-models/law-professors-prefer-ai-over-peer-a… · home topics large-language-models article
[ARTICLE · art-21173] src=marginalrevolution.com pub= topic=large-language-models verified=true sentiment=↑ positive

Law professors prefer AI over peer answers

A blinded evaluation of short-answer tutoring in contracts courses with 16 U.S. law professors found that professors rated large language model (LLM) responses far higher than those written by their peers, with an average win rate of 75.33%. LLM answers were also flagged as harmful far less often (3.53%) than human answers (12.06%). The findings suggest AI tutors can outperform human instructors in judgment-rich domains like law, challenging the value of human expertise in academic evaluation.

read2 min publishedJun 4, 2026

Large language models (LLMs) are increasingly promoted as educational tutors, yet most evaluations focus on domains with a single ground truth. Many disciplines, however, hinge on judgment: reasoning, weighing ambiguity, and reaching defensible conclusions. Law provides a sharp test. We conducted a blinded evaluation of short-answer tutoring in contracts courses with sixteen U.S. law professors. Participants created 40 representative questions, wrote answers, and judged 2,918 anonymized comparisons between human and LLM responses. Professors rated LLMs far higher than their peers (average win rate = 75.33%), with models performing similarly to the best instructor. LLM responses were also rarely flagged as harmful (3.53%, vs 12.06% for professors). Preferences for LLM answers were consistent across evaluators and reflected shared professional standards. Our evaluation can be reliably extended to additional models by employing a separate LLM as a judge, rendering expert agreements an effective, scalable method to evaluate AI tutors in judgment-rich domains.

“far”. That is from a new paper by Alejandro Salinas, et.al. Via Andrew Curran. And via John Chamberlain:

Artificial intelligence (AI) and large language models (LLMs) tools are capable of mass-producing academic finance papers that are nearly indistinguishable from human-authored research, according to a new study published in the

[Journal of Economic Literature]. C’mon people, get ready. I know it is difficult to admit when your human capital has been devalued, but that time is upon us. In particular, being prolific is no longer such a comparative advantage in academia. You might run to the “but I know what questions to ask” cope, but I implore you to solve for the equilibrium. What is the equilibtium wage for merely asking questions?

Of course academic life and projects will continue, but the real rewards will go to people doing new, innovative, and hitherto impossible projects with AI.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/law-professors-prefe…] indexed:0 read:2min 2026-06-04 ·