Using AI Agents to Automate Black-Box Audits of Personalization Algorithms at Scale

wpnews.pro

cd /news/generative-ai/using-ai-agents-to-automate-black-bo… · home › topics › generative-ai › article

[ARTICLE · art-45910] src=arxiv.org ↗ pub=2026-07-01T04:00Z topic=generative-ai verified=true sentiment=· neutral

Using AI Agents to Automate Black-Box Audits of Personalization Algorithms at Scale

Researchers introduced a framework using generative AI agents to automate black-box audits of personalization algorithms, deploying 1,120 agents on X after the 2024 U.S. election. They found that X's algorithmic feed amplifies toxic, polarizing, political, and right-leaning content compared to the chronological feed, with effects varying by user ideology and demographic signals influencing content delivery in persona-dependent ways.

read1 min views1 publishedJul 1, 2026

arXiv:2606.30801v1 Announce Type: new Abstract: Personalization algorithms determine what content users encounter on online platforms. Auditing these systems is difficult because independent auditors have only black-box access to the algorithms, while personalization depends on users' attributes, behavior, and evolving interaction histories. Existing auditing methods face a tradeoff: studies with real users capture realistic behavior but are costly and hard to control, whereas sock-puppet audits scale more easily but often rely on scripted behavior that limits realism. Beyond this, both approaches struggle to decouple user attributes from user behavior, limiting our ability to causally understand personalization. To address this gap, we introduce a framework for black-box audits of personalization algorithms using generative AI agents as behavioral engines for synthetic accounts. Each agent is instantiated with a fixed persona, grounded in demographic and political survey data, and interacts with a platform's content by reasoning about it and choosing actions. Because behavior is fixed within each persona while platform-visible signals such as age, gender, or location can be experimentally perturbed, our design enables counterfactual auditing of how platforms respond to user attributes. As a case study, we deploy 1,120 agents on X shortly after the 2024 U.S. election, spanning 14 personas and three counterfactual conditions, collecting over 200,000 content exposures. We find that X's algorithmic feed amplifies toxic, polarizing, political, and right-leaning content relative to the chronological feed, with amplification varying sharply by user ideology. Counterfactual analyses show that demographic signals affect content delivery in persona-dependent ways: pooled effects are largely null, while subgroup-level effects vary in direction and magnitude. Our work establishes GenAI-based agents as a new tool for algorithmic auditing.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/using-ai-agents-to-autom…

Read original on arxiv.org → arxiv.org/abs/2606.30801

mentioned entities

arXiv

metadata

slugusing-ai-agents-to-automate-black-box-audits-of-personalization-algorithms-at

topic#generative-ai

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevI Built 5 Free AI Tools That Rep…

next →Sivers emission övertecknades "f…

── more in #generative-ai 4 stories · sorted by recency

arxiv.org · 1 Jul · #generative-ai

Training Therapeutic Judges and Multi-Agent Systems for Human-Aligned Mental Health Support

arxiv.org · 1 Jul · #generative-ai

A Single Rewrite Suffices: Empirical Lessons from Production Skill Description Optimization

arxiv.org · 1 Jul · #generative-ai

When transformers learn "impossible" languages, what do they learn?

arxiv.org · 1 Jul · #generative-ai

Beyond Clean Text: Evaluating Encoder and Decoder Robustness for Bangla Event Detection in Noisy Text

── more on @x 3 stories trending now

wpnews · 30 May · #ai-tools

I was wasting 10 minutes every Claude session. So I built a fix.

wpnews · 27 May · #machine-learning

hunting for headroom on modded-nanoGPT (WR #82)

wpnews · 2 Jun · #ai-products

Microsoft launches Discovery platform for scientific R&D with Ginkgo Bioworks partnership

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required