LLMs Contain Multitudes: How Deployment Context Reshapes Model-Level Preferences and Values

wpnews.pro

cd /news/large-language-models/llms-contain-multitudes-how-deployme… · home › topics › large-language-models › article

[ARTICLE · art-27538] src=arxiv.org ↗ pub=2026-06-15T04:00Z topic=large-language-models verified=true sentiment=· neutral

LLMs Contain Multitudes: How Deployment Context Reshapes Model-Level Preferences and Values

A new study finds that large language models' preferences and values shift significantly depending on the deployment context, such as whether the model is writing a Reddit post or a news article. Across five LLMs and over 1.2 million decisions, context-induced variation far exceeded that from prompt paraphrasing or temperature changes, with country preference rankings and utility judgments varying systematically. The findings challenge the notion of stable model-level values, suggesting safety guarantees under one framing may not transfer to others.

read1 min views20 publishedJun 15, 2026

arXiv:2606.13944v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly characterised in recent evaluation work as having stable, model-level preference and value systems. However, accompanying robustness checks are limited to incidental prompt perturbations such as syntax variation and option reordering. This leaves open whether the measured properties survive when the surrounding task context changes, as it does in most real deployments. We test this directly across two established pairwise paradigms: ranking country preferences and eliciting utility judgements. In both, we make the deployment context -- the high-level task the model is performing while making concrete value-dependent choices -- our controlled variable, varied across framings such as writing a Reddit post or a news article. Across five LLMs and over 1.2M pairwise decisions, deployment context produces variation far larger than prompt paraphrasing and temperature controls. In country preference rankings over 15 countries, context induces widespread, statistically significant rank shifts; the aggregate Global North favouritism reported in prior work is itself context-dependent, with each model's bias shifting systematically across contexts. In utility elicitation over 50 outcomes, broad cross-category ordering is preserved, but fine-grained rankings within domains vary substantially, and cardinal exchange rates between outcomes (e.g. how many lives in one region equal one in another) shift by a factor of 2.47 at the median. Reported model-level preferences and utilities are therefore better understood as context-conditioned measurements than fixed model-level properties: safety guarantees obtained under one framing provide limited assurance in another.

source & further reading

arxiv.org — original article

── more in #large-language-models 4 stories · sorted by recency

byteiota.com · 1 Aug · #large-language-models

Google Earth AI Image Generator Pulled in 24 Hours

machinebrief.com · 1 Aug · #large-language-models

EU AI Act Enforcement Starts Tomorrow — Fines Up to €35 Million

thenewstack.io · 1 Aug · #large-language-models

What Claude’s real-world breaches reveal about AI safety tests

cryptobriefing.com · 1 Aug · #large-language-models

Google AI uncovers 13-year-old Chrome flaw amid record patching pace

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required