Unlimited OCR
Unlimited OCR is a 3-billion-parameter open-weight model from Baidu that transcribes multi-page documents in a single inference pass — eliminating the page-by-page chunking that makes traditional OCR pipelines brittle. The core innovation is Reference Sliding Window Attention (R-SWA), which holds KV cache at constant size regardless of output length.
Baidu released the model on June 22, 2026 under the MIT license alongside the arXiv paper "Unlimited OCR Works: Welcome the Era of One-shot Long-horizon Parsing." The model achieves 93.92% on OmniDocBench v1.6 — a 6-point gain over DeepSeek OCR — and processes 40+ page PDFs in a single forward pass within a 32K token context window.
A legal team processing 50-page contracts can run `baidu/Unlimited-OCR` locally via Ollama — the model ingests the full PDF image in one pass, extracts tables, formulas, and dense text with consistent layout awareness, and outputs structured Markdown. No page boundaries to stitch, no context loss mid-document.
Think of it as a photographic memory for scanners — one look at the whole stack, then write it all out.
Search Interest
-
Nascent ← now0–7 days
-
Emergent8–30 days
-
Validating31–90 days
-
Rising91–180 days
-
Established180 days +
Why is it emerging now?
Baidu released Unlimited OCR on June 22, 2026, solving the KV cache blowup that forced every long-document OCR pipeline to chunk by page. The MIT license and Ollama/vLLM compatibility mean teams can swap it in without a managed API, and the 93.92% OmniDocBench v1.6 score beats DeepSeek OCR by over 6 points.
Outlook
6-month signal projection and commercial timeline.
MIT license, Ollama/vLLM support, and constant-memory long-document parsing fill a real gap in enterprise document workflows.
Risk · No managed API — teams must self-host a GPU, limiting reach to infra-capable buyers.
Analogs · deepseek-ocr · mistral-ocr · surya
-
nowSelf-host pipeline tools
Wrap the model as a REST API for document teams replacing cloud OCR vendors.
-
3-6moManaged API + comparison content
First managed-API wrapper SaaS and SEO content ranking for 'unlimited ocr vs mistral ocr' queries.
-
6-12moEnterprise contract parsing
Legal, compliance, and finance verticals adopt long-document OCR pipelines at scale.
Competition & Opportunity for term “Unlimited OCR”
Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.
Ideas for term “Unlimited OCR”
Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.
High commercial intent — autocomplete shows 'unlimited ocr api' and 'unlimited ocr vs' queries already forming. Covers accuracy, cost, self-host vs API trade-offs.
Tutorial demand is immediate — 45k HF downloads in 48 hours signals dev adoption. Targets the long tail of 'unlimited ocr free' and 'unlimited ocr pdf' searches.
SEO gap: the technique name is novel and has zero explainer coverage yet. Captures ML engineer research traffic from 'R-SWA unlimited ocr' and 'constant KV cache OCR'.
Baidu ships no hosted API — large gap for a SaaS wrapper targeting teams that need OCR without GPU infra. Direct path to recurring revenue from document-heavy businesses.
Combines Unlimited OCR's long-doc accuracy with downstream vector indexing. Targets legal, compliance, and research teams moving away from AWS Textract or Azure Document Intelligence.
Category is maturing fast — a community benchmark tracker fills the gap left by fragmented individual blog posts and captures long-tail comparison queries.
Visual format suits the diff — show the actual output side-by-side. High shareability in developer and legal-tech communities.
Every other long-document OCR system chunks your PDF into pages, processes each independently, and stitches the output. Unlimited OCR does it in one pass — and the KV cache never grows.
unlimitedocr.com was registered the same day Baidu published the paper. By day two, .org and .xyz were gone. The model had 5k stars before most people had even read the abstract.
AWS Textract charges per page. Azure Document Intelligence charges per page. Unlimited OCR charges $0 — runs on your own GPU, MIT licensed, and matches their accuracy on the standard benchmark.
What People Search
Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.
SERP of term “Unlimited OCR”
What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.
FAQ
What is Unlimited OCR?
Unlimited OCR is a 3-billion-parameter open-weight model from Baidu that transcribes multi-page documents in a single inference pass — eliminating the page-by-page chunking that makes traditional OCR pipelines brittle.
Why is Unlimited OCR emerging now?
Baidu released Unlimited OCR on June 22, 2026, solving the KV cache blowup that forced every long-document OCR pipeline to chunk by page. The MIT license and Ollama/vLLM compatibility mean teams can swap it in without a managed API, and the 93.92% OmniDocBench v1.6 score beats DeepSeek OCR by over 6 points.
When did Unlimited OCR emerge?
Publicly emerged around 2026-06-22 (about 3 days ago as of 2026-06-25). EarlyTerms first recorded a pipeline signal on 2026-06-24.
Related Terms
Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.
- Part of ·
- Includes
- Competitor ·
- Related ···
Sources
Primary URLs this report cites — open any to verify the claim yourself.
- 01 baidu/Unlimited-OCR — official GitHub repository github.com ↗
- 02 Unlimited OCR Works — arXiv paper (Jun 22, 2026) arxiv.org ↗
- 03 baidu/Unlimited-OCR — Hugging Face model card huggingface.co ↗
- 04 Hacker News — Unlimited OCR: One-shot long-horizon parsing (478 pts) news.ycombinator.com ↗
- 05 AI Weekly — Baidu Releases MIT-Licensed 3B OCR Model for Long Documents aiweekly.co ↗
- 06 Data Science in Your Pocket — Baidu's Unlimited OCR: Beats DeepSeek OCR, Parses Entire Book in One Go medium.com ↗