Connect with us

Hi, what are you looking for?

Top Stories

Mistral AI Launches OCR 3, Reducing Costs to $1 per 1,000 Pages with Enhanced Accuracy

Mistral AI unveils OCR 3, enhancing accuracy with a 74% improvement and lowering costs to $1 per 1,000 pages for high-volume processing.

Mistral AI has unveiled its latest optical character recognition service, Mistral OCR 3, designed to enhance the company’s Document AI stack. The model, designated mistral-ocr-2512, specializes in extracting interleaved text and images from PDFs and other documents while maintaining their original structure. At an aggressive pricing point of $2 per 1,000 pages, users can benefit from a 50% discount when utilizing the Batch API, significantly reducing costs for high-volume processing.

Optimized for common enterprise document workloads, Mistral OCR 3 targets a variety of document types. This includes forms, scanned documents, complex tables, and handwritten text. According to internal benchmarks based on real business scenarios, the model achieves a 74% win rate over its predecessor, Mistral OCR 2, when assessed using a fuzzy match metric against established ground truth datasets.

The new model outputs markdown that not only preserves the document layout but also enriches it with HTML-based table representations when specifically enabled. This ensures that downstream systems receive both the content and the structural information essential for retrieval pipelines, analytics, and automated workflows.

As a crucial component of Mistral Document AI, the OCR 3 model integrates seamlessly with the company’s broader document processing capabilities, which combine OCR with structured data extraction and Document QnA. This functionality is now showcased within the Document AI Playground in Mistral AI Studio, allowing users to upload PDFs or images and receive either clean text or structured JSON outputs without requiring any coding knowledge. The same underlying OCR pipeline can also be accessed via a public API, facilitating a smooth transition from exploratory use to production workloads.

The OCR processor supports multiple document formats through a unified API. Users can point the document field to various types, including document_url for PDFs and other document formats, and image_url for image formats like PNG and JPEG. The flexibility extends to uploaded or base64-encoded images and PDFs, thereby accommodating a diverse array of input types.

The response is a JSON object containing a pages array. Each page includes an index, a markdown string, a list of images, optional tables (if table_format=”html” is enabled), detected hyperlinks, and additional fields for headers and footers, if extraction is enabled. The response also includes a document_annotation field for structured annotations and a usage_info block for accounting details.

Mistral OCR 3 boasts several enhancements over Mistral OCR 2, emphasizing key improvements in four primary areas. First, the model offers better handwriting recognition, including more accurate interpretation of cursive text and annotations on printed templates. Second, it enhances forms processing by improving the detection of boxes, labels, and handwritten entries in complex layouts, which are frequently found in invoices and compliance documents. Third, it demonstrates greater robustness in handling scanned pages, overcoming challenges such as compression artifacts and low resolutions. Finally, it excels at reconstructing complex table structures with various hierarchies and can return HTML tables that maintain proper layout.

The pricing structure for Mistral OCR 3 is straightforward, with costs set at $2 per 1,000 pages for standard OCR and $3 per 1,000 pages for pages with structured annotations. When used through the Batch Inference API, the effective cost can drop to $1 per 1,000 pages, incentivizing large-scale processing. The model also integrates structured annotations and bounding box extraction features, enabling developers to label specific document regions and retrieve bounding boxes for better content mapping in downstream systems.

In summary, Mistral OCR 3 represents a significant advancement in optical character recognition technology, combining competitive pricing with enhanced capabilities. With its robust feature set, the model positions itself as a strong contender in both traditional and AI-native OCR landscapes, delivering valuable tools for document processing and analysis. As businesses increasingly seek efficiency and accuracy in document handling, Mistral’s latest offering could play a pivotal role in meeting these demands.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Mistral AI commits €1.2B to build Nordic data centers, boosting Europe's A.I. autonomy and positioning itself as a rival to OpenAI and Microsoft.

Top Stories

Mistral AI and EcoDataCenter announce a €1.2 billion investment in a Swedish AI data center, set to enhance Europe's technological autonomy by 2027.

Top Stories

Mistral AI invests €1.2 billion in a Sweden-based AI data center, aiming for European digital sovereignty and local data processing by 2027.

Top Stories

Mistral AI invests €1.2 billion in a new Swedish data center to enhance European AI infrastructure and bolster regional digital sovereignty by 2027

Top Stories

Global tech leaders, including Sundar Pichai and Sam Altman, will converge at India’s first AI Impact Summit, set for February 16-20, to shape global...

Top Stories

Mistral AI unveils Voxtral Transcribe 2, delivering real-time transcription with under 200ms latency for just $0.006 per minute, revolutionizing speech-to-text technology.

AI Technology

Mistral AI partners with industry leaders like Cisco and Stellantis to unveil four essential criteria for selecting impactful enterprise AI use cases, ensuring rapid...

Top Stories

Mistral AI targets over €1 billion in revenue by year-end, driven by licensing and subscriptions, following a strong €1.7 billion funding round.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.