Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

wpnews.pro

cd /news/large-language-models/vectors-are-not-neutral-sensitive-in… · home › topics › large-language-models › article

[ARTICLE · art-14925] src=arxiv.org ↗ pub=2026-05-27T04:00Z topic=large-language-models verified=true sentiment=· neutral

Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

A new study finds that vector representations exported from large language model summarization systems can leak sensitive information, even when source documents remain access-restricted. Researchers auditing clinical discharge summaries from electronic health records demonstrated that race information could be inferred from exported vectors, and that reducing recoverability from one artifact did not guarantee protection from another. The findings underscore that privacy auditing must target the exact vector artifact exposed to downstream components, as mitigations like the proposed SurfaceLoRA method may fail to protect untargeted representations.

read1 min views9 publishedMay 27, 2026

arXiv:2605.26433v1 Announce Type: new Abstract: Large language model (LLM) summarization systems may pass compact vector representations of private inputs to downstream retrieval, monitoring, audit, or analytic workflows. Even when source documents remain access-restricted, derived vectors may be handled under different access controls and still support sensitive-information inference, creating a residual information-disclosure risk. We study this issue in clinical discharge-summary generation as a high-stakes case study, using electronic health record (EHR)-recorded race as a controlled sensitive-label audit. We audit two artifacts that a system might retain or expose to downstream components: the final prompt-token hidden state and the mean-pooled prompt representation. Our results show that reducing recoverability of the case-study sensitive label from one exported artifact does not necessarily reduce recoverability from another. As a mitigation case study, we introduce SurfaceLoRA, an exported-vector-targeted parameter-efficient fine-tuning method that uses a gradient-reversal discriminator attached to a designated exported vector. Under a balanced five-way probing protocol, SurfaceLoRA reduces EHR-recorded race recoverability from the targeted final-token artifact toward chance while preserving summarization utility, yet recoverability remains substantially higher from untargeted pooled artifacts. These findings show that privacy auditing and mitigation should be performed on the exact vector artifact retained or exposed to downstream components.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/vectors-are-not-neutral-…

Read original on arxiv.org → arxiv.org/abs/2605.26433

mentioned entities

SurfaceLoRA

EHR

metadata

slugvectors-are-not-neutral-sensitive-information-inference-from-exported-llm-in

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevSejong University launches Asia’…

next →European AI adoption hits 99% wi…

── more in #large-language-models 4 stories · sorted by recency

sourcefeed.dev · 12 Jul · #large-language-models

Causality Turns LLM Internals Into Debuggable Circuits

blog.disclose.io · 12 Jul · #large-language-models

Policy Pulse - Issue #24 | Week of July 11, 2026

dev.to · 12 Jul · #large-language-models

$440 Million in 45 Minutes: When a Company's Own Automated System Loses the Company's Own Money

machinebrief.com · 12 Jul · #large-language-models

FLARE-AI: Shining a Light on AI's Dark Side

── more on @surfacelora 3 stories trending now

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

wpnews · 8 Jul · #artificial-intelligence

SpaceXAI unveils Grok 4.5 AI model ahead of July 2026 public release

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required