ProfileFoundry: A Synthetic Person-Object Substrate for Privacy, Memory, and Tool-Use Evaluation in LLM Agent

wpnews.pro

cd /news/large-language-models/profilefoundry-a-synthetic-person-ob… · home › topics › large-language-models › article

[ARTICLE · art-40243] src=arxiv.org ↗ pub=2026-06-26T04:00Z topic=large-language-models verified=true sentiment=· neutral

ProfileFoundry: A Synthetic Person-Object Substrate for Privacy, Memory, and Tool-Use Evaluation in LLM Agent

Researchers released ProfileFoundry, a synthetic dataset of 100,000 person objects with 709,228 events, 40,338 households, and 52,491 employers, designed for evaluating LLM agents on memory, privacy, and tool-use tasks. The deterministic generator ensures cross-field and temporal consistency without using real user data, enabling responsible redistribution and controlled evaluation.

read1 min views2 publishedJun 26, 2026

arXiv:2606.26403v1 Announce Type: new Abstract: Foundation-model research increasingly needs data about people: user state, personal histories, relationships, contact-like fields, documents, and longitudinal updates. Real user data is difficult to share, perturb, audit, or redistribute responsibly, while independently generated fake fields rarely preserve the cross-field and temporal consistency needed for controlled evaluation. We present PROFILEFOUNDRY, a deterministic generator and fixed reference release of 100,000 adult synthetic Person Objects across eight locales. Each object combines a typed current snapshot, household, family, and employer links, snapshot-aligned events, normalized relational views, and generation provenance. The release contains 709,228 events, 40,338 households, 52,491 employers, and 518,564 directed relationship edges. We report evidence in separate categories: selected population-marginal comparisons, per-object invariant checks, release-wide referential and temporal closure, and coincidence/provenance screens. PROFILEFOUNDRY is not a population-fidelity model, a rendered-text corpus, or a formal privacy mechanism. Instead, it is a responsible synthetic source layer for constructing downstream foundation-model evaluations involving memory, privacy, document understanding, record linkage, and agent state while keeping the synthetic person behind each artifact inspectable

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/profilefoundry-a-synthet…

Read original on arxiv.org → arxiv.org/abs/2606.26403

mentioned entities

ProfileFoundry

arXiv

metadata

slugprofilefoundry-a-synthetic-person-object-substrate-for-privacy-memory-and-tool

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevHo progettato un'infrastruttura …

next →Inside the infrastructure behind…

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 26 Jun · #large-language-models

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

arxiv.org · 26 Jun · #large-language-models

AlgoEvolve: LLM-driven Meta-evolution of Algorithmic Trading Programs

arxiv.org · 26 Jun · #large-language-models

Know2Guess: A Contamination-Aware Multi-Zone Benchmark for Knowledge-Boundary Evaluation in Large Language Models

arxiv.org · 26 Jun · #large-language-models

Reducing Conversational Escalation in Large Language Model Dialogue with Nonviolent Communication Constraints

── more on @profilefoundry 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required