Topics as Proxies for Sociodemographics: How Conversational Context Affects LLM Answers

wpnews.pro

cd /news/large-language-models/topics-as-proxies-for-sociodemograph… · home › topics › large-language-models › article

[ARTICLE · art-19927] src=arxiv.org ↗ pub=2026-06-03T04:00Z topic=large-language-models verified=true sentiment=· neutral

Topics as Proxies for Sociodemographics: How Conversational Context Affects LLM Answers

A new study finds that large language models struggle to infer user sociodemographics from a single conversation history, but that conversation topics serve as proxies for these groups and drive disparities in LLM-generated advice. Researchers determined that topics, rather than user demographics, are the primary predictor of outcome differences in high-stakes scenarios like legal, medical, and financial advice. The findings highlight a need for further research into how conversational context affects LLM outputs, as topics can influence advice in unpredictable ways.

read1 min views17 publishedJun 3, 2026

arXiv:2606.02776v1 Announce Type: new Abstract: When large language models (LLMs) are used in high-stakes scenarios, such as legal, medical and financial advice, even a single conversation history is enough to drive differences in outcomes between users. Prior work has demonstrated that this results in outcome disparities between sociodemographic groups, with some groups receiving more advantageous outcomes than others. In this work, we demonstrate that LLMs actually struggle to infer user sociodemographics from a single conversation history and that although there are disparities between sociodemographic groups, they are minimal in magnitude. To investigate what the main driver of these disparities is, we compare user sociodemographics to a range of (psycho)linguistic features of conversations, including conversation topic, emotions, and readability. We find that conversation topics are most predictive of LLM-generated advice within a conversational context, which, to some extent, function as proxies for sociodemographic groups and often affect advice in unpredictable ways. This is cause for concern and highlights the need for future research to better understand and, if needed, mitigate the effect of conversational context on LLM outputs in high-stakes scenarios.

source & further reading

arxiv.org — original article

── more in #large-language-models 4 stories · sorted by recency

dev.to · 18 Jul · #large-language-models

When Gatekeepers Panic: The Encyclopédie, Open AI Models, and the Politics of Accessible Knowledge

dev.to · 18 Jul · #large-language-models

AI in the QMS — what it actually does vs the marketing blur

dev.to · 18 Jul · #large-language-models

5 Things I Learned Doing AI Evaluation for 2 Years

dev.to · 18 Jul · #large-language-models

OpenAI ships GPT-Red to automate prompt-injection testing against AI agents

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required