Productivity API $4 / 1,000 pages ($2 batch); Document AI $5 / 1,000 pages

Mistral OCR 4

Mistral's OCR model that turns documents into structured data — bounding boxes, block classification, and confidence scores across 170 languages.

Updated 2026-07-01

8.2

AI Score / 10

Visit Mistral OCR 4

Overview

Mistral OCR 4 is a document-understanding model that converts PDFs, scans, and images into structured, machine-readable output — not just a wall of extracted text, but text tied to its position on the page. Each element comes back with a bounding box, a block classification (heading, paragraph, table, figure, and so on), and a per-element confidence score, across 170 languages. That structure is what separates it from plain text OCR: downstream systems can reason about layout, pull specific fields, and flag low-confidence regions for review instead of guessing.

Released June 23, 2026, it sits at the front of what Mistral is positioning as an enterprise document pipeline. You can hit the raw OCR model through the API at $4 per 1,000 pages (dropping to roughly $2 per 1,000 with the batch discount), or use the higher-level Document AI product at $5 per 1,000 pages when you want the extraction-to-fields workflow handled for you. The pricing is aggressive for what is a genuinely structured output, and the batch tier makes large back-catalog digitization projects realistic to budget.

It's built for teams processing documents at volume — invoice and receipt extraction, contract and form parsing, insurance and legal intake, and feeding clean, positioned text into RAG systems where layout context actually matters. It's an API-first, developer-facing product, so it competes less with consumer scan apps and more with Google Document AI, AWS Textract, and Azure Document Intelligence.

Key features

Positional extraction

Returns bounding boxes for every detected element, so extracted text stays tied to where it sits on the page — essential for tables, forms, and multi-column layouts that flat OCR mangles.

Block classification

Labels each region by type (heading, paragraph, table, figure, etc.), giving downstream code semantic structure to parse against instead of an undifferentiated text blob.

Confidence scoring

Every element carries a confidence score, letting pipelines auto-accept high-certainty regions and route low-confidence ones to human review rather than silently propagating errors.

Broad language coverage

Handles 170 languages, covering multilingual document sets and non-Latin scripts that many OCR engines struggle with.

Pricing

Plan	Price	What's included
OCR API	$4 per 1,000 pages	Raw structured OCR — bounding boxes, block classification, confidence scores, 170 languages. Pay-as-you-go via the Mistral API.
OCR API (Batch)	~$2 per 1,000 pages	Batch-processing discount (roughly 50% off) for large, non-latency-sensitive jobs like archive digitization.
Document AI	$5 per 1,000 pages	Higher-level document product layering extraction-to-fields workflows on top of the OCR model.

OCR API $4 per 1,000 pages

Raw structured OCR — bounding boxes, block classification, confidence scores, 170 languages. Pay-as-you-go via the Mistral API.

OCR API (Batch) ~$2 per 1,000 pages

Batch-processing discount (roughly 50% off) for large, non-latency-sensitive jobs like archive digitization.

Document AI $5 per 1,000 pages

Higher-level document product layering extraction-to-fields workflows on top of the OCR model.

Pros & cons

Pros

✓Structured output — bounding boxes, block types, and confidence scores, not just flat text
✓Aggressive per-page pricing with a batch tier that roughly halves cost for bulk jobs
✓170-language coverage handles multilingual and non-Latin document sets
✓Confidence scores enable clean auto-accept vs. human-review routing in pipelines
✓API-first design fits directly into RAG and enterprise document workflows

Cons

×No free tier — you pay from the first page, unlike some rivals' monthly free quotas
×Developer/API product with no polished consumer scan-app front end
×Enters a crowded field against entrenched Google Document AI, AWS Textract, and Azure Document Intelligence
×Real-world accuracy on messy handwriting and low-quality scans isn't yet independently benchmarked

How it compares

Tool	Best for	Pricing	Score
Mistral OCR 4	—	API $4 / 1,000 pages ($2 batch); Document AI $5 / 1,000 pages	8.2/10
Notion AI vs Notion AI →	—	From $10/mo	9/10
Zapier	—	Freemium	8.9/10
Fathom vs Fathom →	—	Freemium — Free core; Team $19/user/mo	8.8/10