AI Firms Offer to Fund Library Digitization AI companies are offering to fund library digitization in exchange for training rights, prompting the University of Virginia to publish a 'Statement of Shared Practice' with six commitments to separate digitization from AI use and ensure transparency. At least sixteen institutions have signed on, establishing a governance framework for evaluating such requests. AI Firms Offer to Fund Library Digitization Libraries are being approached by AI companies offering to pay for digitization of archival materials in exchange for AI training rights, according to a Daily News Item on pw.org June 15 . Per pw.org, the University of Virginia published a "Statement of Shared Practice" that has drawn signatories from at least sixteen institutions, establishing a shared governance framework for evaluating such requests. Coordinated by UVA dean of libraries and university librarian Leo S. Lo, the statement's six commitments include separating digitization from AI use, requiring provenance and transparency, and applying heightened scrutiny to broad training requests, per the UVA Library announcement April 3, 2026 . The pw.org piece quotes Lo: "We have responsibilities to the people who donated these materials to us." Endorsement is open to any institution that stewards unique cultural collections, including libraries, archives, and museums. What happened Per a Daily News Item on pw.org June 15 , libraries have been approached by AI companies offering to fund digitization of archival materials so the resulting data can be used to train models. The pw.org article reports that the University of Virginia published a "Statement of Shared Practice on AI and Archives," and that institutions including Duke University, Florida State, Northwestern, Oklahoma State, Rice, Tulane, University of Florida, University of Rochester, Wisconsin-Madison, Washington University in St. Louis, and Wayne State University joined as founding cosignatories when the statement launched on April 3, 2026, per the UVA Library announcement. Pw.org reports additional institutions have since signed on. The six shared commitments Per the UVA Library announcement, participating institutions agree to: - •use shared definitions to classify AI training requests - •evaluate digitization funding and AI training rights as distinct agreements - •require provenance and transparency -- determining whether source materials can be traced and removed - •prefer retrieval-based approaches where collections remain under institutional control - •apply a presumption against broad commercial training where meaningful provenance is unrealistic - •share information through a confidential ledger of request types and general terms across signatories Context and significance Lo is quoted in the pw.org piece: "We have responsibilities to the people who donated these materials to us." The UVA Library announcement elaborates: "Libraries hold materials that exist nowhere else. When a company asks to use those materials to train an AI system, we owe it to the people who entrusted them to us to ask hard questions before saying yes." The statement was developed following an Association of Research Libraries peer session attended by representatives from more than 42 institutions, per the UVA Library. Endorsement is open to any institution that stewards unique cultural collections -- libraries, archives, and museums -- not limited to universities. Technical context for practitioners For practitioners building or auditing training datasets, the shared commitments affect provenance tracking, license eligibility, and the scope of permissible downstream uses. Contracts that conflate digitization payment with unlimited model training can create legal and reproducibility complications for downstream model releases. The statement's emphasis on retrieval-based approaches where source materials remain under institutional control versus absorption-based training reflects a growing practitioner distinction in how archival corpora are accessed and attributed. What to watch Monitor whether additional institutions cosign; the precise contract clauses when libraries accept paid digitization rights retained, redaction processes, embargo periods ; and whether any public policy or legal challenges clarify reuse rights for digitized archival materials. The statement's 12-month commitment window runs through April 2027. Scoring Rationale The story is relevant to AI/ML practitioners involved in training data sourcing and governance: library archival corpora are common inputs to large model training, and the Statement of Shared Practice establishes a new sector-wide governance floor with 12+ founding institutions. Solid niche coverage with clear practitioner implications, but the event is a policy framework launch -- not a frontier model release, major funding event, or landmark regulation. Practice interview problems based on real data 1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with. Try 250 free problems /problems