Show HN: Polyvia – Multimodal document retrieval over 100K+ files Polyvia released Polyvia 1, a multimodal document retrieval API and upcoming platform for enterprise agents, enabling sub-200ms search over 100K+ files including PDFs, charts, and slides. The API provides end-to-end retrieval without external extractors, targeting use cases like data room search and credit monitoring. We build enterprise agents for large-scale retrieval, research and automation over multimodal docs. Docs https://docs.polyvia.ai · Quickstart https://docs.polyvia.ai/quickstart · Python SDK https://docs.polyvia.ai/products/python-sdk · TypeScript SDK https://docs.polyvia.ai/products/js-sdk · Polyvia Platform https://app.polyvia.ai · Homepage https://polyvia.ai We’re releasing Polyvia 1, as two products: Polyvia API: Multimodal Document Retrieval API for developers of AI agents - available now. Polyvia Platform: Research and Automation Agent over 100K+ multimodal docs for knowledge workers in enterprises - coming soon. We index your unstructured & visual & multimodal docs PDFs, charts, slides, complex tables, infographics, scans, handwriting, invoices, and more into multimodal knowledge ontology, with agents running on top for retrieval, research and automation — every answer grounded in a cited source page, in sub-200ms. 1. Fast over 100K+ multimodal docs. Agentic, file-by-file search Claude Code, Claude Cowork, Codex works only up to ~100 multimodal files — past that it's too slow, and at scale you still need retrieval . Polyvia does sub-200ms search over 100K+ files, every answer grounded in a cited source page. 2. End-to-end — no need for extractors or PDF parsers. When you build large-scale multimodal RAG over a company's files, the only infra available today is visual extractors / PDF parsers Reducto, LlamaIndex . There's no end-to-end infra for large-scale multimodal document retrieval — until Polyvia: VLM Visual Extractor → Multimodal Knowledge Ontology mapping all your company's data and processes → Self-Improving Retrieval Agent . 3. All unstructured, visual and multimodal data inputs in one API. Available now: PDFs, charts, infographics, complex multi-page tables, slides, pictures, handwriting, scans, invoices, audio. Coming soon: video, healthcare scans / EHR, chemical & molecular data, CAD & technical drawings, heatmaps. Multimodal RAG inside your own agent — retrieval-as-a-tool over large doc sets. Data-room / due-diligence search — query 100+ visual-heavy PDFs jointly PE, IB, M&A . Counterparty & credit monitoring — EBITDA, opex, revenue across hundreds of borrower reports. Image-based claim processing — describe claim photos in the context of a policy. Cross-engagement slide search — find answers buried in thousands of slides. pip install polyvia Python 3.9+ npm install polyvia Node 18+ Grab a key in Polyvia Platform https://app.polyvia.ai → Settings → API . Ingest a batch into a group , then ask one question across the whole corpus — answers cite the exact page in each document. python from polyvia import Polyvia client = Polyvia api key="poly