Mistral OCR 4
Mistral's OCR model that turns documents into structured data — bounding boxes, block classification, and confidence scores across 170 languages.
Updated 2026-07-01
Overview
Mistral OCR 4 is a document-understanding model that converts PDFs, scans, and images into structured, machine-readable output — not just a wall of extracted text, but text tied to its position on the page. Each element comes back with a bounding box, a block classification (heading, paragraph, table, figure, and so on), and a per-element confidence score, across 170 languages. That structure is what separates it from plain text OCR: downstream systems can reason about layout, pull specific fields, and flag low-confidence regions for review instead of guessing.
Released June 23, 2026, it sits at the front of what Mistral is positioning as an enterprise document pipeline. You can hit the raw OCR model through the API at $4 per 1,000 pages (dropping to roughly $2 per 1,000 with the batch discount), or use the higher-level Document AI product at $5 per 1,000 pages when you want the extraction-to-fields workflow handled for you. The pricing is aggressive for what is a genuinely structured output, and the batch tier makes large back-catalog digitization projects realistic to budget.
It's built for teams processing documents at volume — invoice and receipt extraction, contract and form parsing, insurance and legal intake, and feeding clean, positioned text into RAG systems where layout context actually matters. It's an API-first, developer-facing product, so it competes less with consumer scan apps and more with Google Document AI, AWS Textract, and Azure Document Intelligence.
Key features
Positional extraction
Returns bounding boxes for every detected element, so extracted text stays tied to where it sits on the page — essential for tables, forms, and multi-column layouts that flat OCR mangles.
Block classification
Labels each region by type (heading, paragraph, table, figure, etc.), giving downstream code semantic structure to parse against instead of an undifferentiated text blob.
Confidence scoring
Every element carries a confidence score, letting pipelines auto-accept high-certainty regions and route low-confidence ones to human review rather than silently propagating errors.
Broad language coverage
Handles 170 languages, covering multilingual document sets and non-Latin scripts that many OCR engines struggle with.
Pricing
| Plan | Price | What's included |
|---|---|---|
| OCR API | $4 per 1,000 pages | Raw structured OCR — bounding boxes, block classification, confidence scores, 170 languages. Pay-as-you-go via the Mistral API. |
| OCR API (Batch) | ~$2 per 1,000 pages | Batch-processing discount (roughly 50% off) for large, non-latency-sensitive jobs like archive digitization. |
| Document AI | $5 per 1,000 pages | Higher-level document product layering extraction-to-fields workflows on top of the OCR model. |
Raw structured OCR — bounding boxes, block classification, confidence scores, 170 languages. Pay-as-you-go via the Mistral API.
Batch-processing discount (roughly 50% off) for large, non-latency-sensitive jobs like archive digitization.
Higher-level document product layering extraction-to-fields workflows on top of the OCR model.
Pros & cons
Pros
- ✓Structured output — bounding boxes, block types, and confidence scores, not just flat text
- ✓Aggressive per-page pricing with a batch tier that roughly halves cost for bulk jobs
- ✓170-language coverage handles multilingual and non-Latin document sets
- ✓Confidence scores enable clean auto-accept vs. human-review routing in pipelines
- ✓API-first design fits directly into RAG and enterprise document workflows
Cons
- ×No free tier — you pay from the first page, unlike some rivals' monthly free quotas
- ×Developer/API product with no polished consumer scan-app front end
- ×Enters a crowded field against entrenched Google Document AI, AWS Textract, and Azure Document Intelligence
- ×Real-world accuracy on messy handwriting and low-quality scans isn't yet independently benchmarked
How it compares
| Tool | Best for | Pricing | Score |
|---|---|---|---|
| Mistral OCR 4 | — | API $4 / 1,000 pages ($2 batch); Document AI $5 / 1,000 pages | 8.2/10 |
| Notion AI vs Notion AI → | — | From $10/mo | 9/10 |
| Zapier | — | Freemium | 8.9/10 |
| Fathom vs Fathom → | — | Freemium — Free core; Team $19/user/mo | 8.8/10 |
Compare head-to-head
Related reading
Claude Fable 5 Returns as Export Controls Lift
Anthropic restores global access to Claude Fable 5 and Mythos 5 on July 1 after the US lifted an 18-day export block. Here's what changed.
Cursor for iOS: AI Coding Agents Now Mobile
Cursor launched a native iPhone and iPad app on June 29, 2026 to start and steer cloud coding agents from anywhere. Here's what it does.
Runway Partners With MIXI for AI Gaming & More
Runway's strategic partnership with Japan's MIXI brings generative video and world models to gaming and entertainment. The scope and the read.
Ready to try Mistral OCR 4?
Head to the official site to start with Mistral OCR 4 — pricing and plans are listed above.
Visit Mistral OCR 4

