Unlimited OCR: One-shot long-horizon parsing Baidu released Unlimited-OCR, a one-shot long-horizon parsing model that extends DeepSeek-OCR, on June 22, 2026. The open-source model supports single-image and multi-page PDF parsing with configurable image sizes and is available on GitHub and ModelScope. - 2026/06/23 📄 Our paper is now available on arXiv https://arxiv.org/abs/2606.23050 . - 2026/06/23 🤝 Thanks to the ModelScope community for their support. Our model is now available at ModelScope https://modelscope.cn/models/PaddlePaddle/Unlimited-OCR . - 2026/06/22 🚀 We present Unlimited-OCR https://github.com/baidu/Unlimited-OCR , aiming to push Deepseek-OCR https://github.com/deepseek-ai/DeepSeek-OCR one step further. Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.12.3 + CUDA12.9: torch==2.10.0 torchvision==0.25.0 transformers==4.57.1 Pillow==12.1.1 matplotlib==3.10.8 einops==0.8.2 addict==2.4.0 easydict==1.13 pymupdf==1.27.2.2 psutil==7.2.2 python import os import torch from transformers import AutoModel, AutoTokenizer model name = 'baidu/Unlimited-OCR' tokenizer = AutoTokenizer.from pretrained model name, trust remote code=True model = AutoModel.from pretrained model name, trust remote code=True, use safetensors=True, torch dtype=torch.bfloat16, model = model.eval .cuda ── Single image supports two configs: gundam or base ── gundam: base size=1024, image size=640, crop mode=True base: base size=1024, image size=1024, crop mode=False model.infer tokenizer, prompt='