07:09
2026-05-26
arxiv.org
machine-learning
Contrastive Decoding Diffing: Recovering Finetuning Data Without Weight Access
Researchers have developed Contrastive Decoding Diffing (CDD), a method that recovers verbatim facts from finetuned language models using only output-level logit distributions, without requiring accesβ¦