cd /news/large-language-models/evaluating-second-order-bias-of-llms… · home topics large-language-models article
[ARTICLE · art-30535] src=arxiv.org ↗ pub= topic=large-language-models verified=true sentiment=· neutral

Evaluating Second-Order Bias of LLMs Through Epistemic Entitlement

Researchers at the University of Toronto introduced a novel method to evaluate second-order social bias in large language models (LLMs), where models exhibit bias in their judgments about biased content. Using a reasoning task based on epistemic entitlement, they found that LLMs infer demographics without sufficient support and vary judgments across targeted groups, evading safety guardrails. The study highlights the need for theoretically grounded bias evaluation in LLM judgment tasks.

read1 min views1 publishedJun 17, 2026

arXiv:2606.17506v1 Announce Type: new Abstract: Evaluations of social bias in LLMs largely focus on whether models generate or imply biased content. However, as LLMs are increasingly used as judges of bias, they may exhibit social biases in subtler ways in how they evaluate biased content, which current methods do not systematically capture. We call this second-order bias: social bias in an LLM's judgment about social bias, which we evaluate through a novel, philosophically grounded reasoning task. Drawing on entitlement epistemology, we conceptualize bias as misplaced foundational knowledge that shapes an agent's rational inquiry, and derive a logical reasoning task for LLMs to judge to whom a biased text is acceptable or non-acceptable. We develop two simple metrics to measure how biased LLM judges are in inferring demographics for acceptability without sufficient support, and how these inferences vary across groups targeted by biased texts. Evaluating open and closed models, we find that our task evades safety guardrails by surfacing bias in model judgment. It varies systematically across target groups, reflects implicit social maps, and shows how models are still triggered by demographic labels. Our work points to the need for LLM bias evaluation in judgment tasks and broadly, for more theoretically grounded approaches to bias evaluation in NLP. We release our code and model responses at https://github.com/uofthcdslab/second-order-bias.

── more in #large-language-models 4 stories · sorted by recency
── more on @university of toronto 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/evaluating-second-or…] indexed:0 read:1min 2026-06-17 ·