Connect with us

Hi, what are you looking for?

Top Stories

Mistral AI Launches OCR 3, Reducing Costs to $1 per 1,000 Pages with Enhanced Accuracy

Mistral AI unveils OCR 3, enhancing accuracy with a 74% improvement and lowering costs to $1 per 1,000 pages for high-volume processing.

Mistral AI has unveiled its latest optical character recognition service, Mistral OCR 3, designed to enhance the company’s Document AI stack. The model, designated mistral-ocr-2512, specializes in extracting interleaved text and images from PDFs and other documents while maintaining their original structure. At an aggressive pricing point of $2 per 1,000 pages, users can benefit from a 50% discount when utilizing the Batch API, significantly reducing costs for high-volume processing.

Optimized for common enterprise document workloads, Mistral OCR 3 targets a variety of document types. This includes forms, scanned documents, complex tables, and handwritten text. According to internal benchmarks based on real business scenarios, the model achieves a 74% win rate over its predecessor, Mistral OCR 2, when assessed using a fuzzy match metric against established ground truth datasets.

The new model outputs markdown that not only preserves the document layout but also enriches it with HTML-based table representations when specifically enabled. This ensures that downstream systems receive both the content and the structural information essential for retrieval pipelines, analytics, and automated workflows.

As a crucial component of Mistral Document AI, the OCR 3 model integrates seamlessly with the company’s broader document processing capabilities, which combine OCR with structured data extraction and Document QnA. This functionality is now showcased within the Document AI Playground in Mistral AI Studio, allowing users to upload PDFs or images and receive either clean text or structured JSON outputs without requiring any coding knowledge. The same underlying OCR pipeline can also be accessed via a public API, facilitating a smooth transition from exploratory use to production workloads.

The OCR processor supports multiple document formats through a unified API. Users can point the document field to various types, including document_url for PDFs and other document formats, and image_url for image formats like PNG and JPEG. The flexibility extends to uploaded or base64-encoded images and PDFs, thereby accommodating a diverse array of input types.

The response is a JSON object containing a pages array. Each page includes an index, a markdown string, a list of images, optional tables (if table_format=”html” is enabled), detected hyperlinks, and additional fields for headers and footers, if extraction is enabled. The response also includes a document_annotation field for structured annotations and a usage_info block for accounting details.

Mistral OCR 3 boasts several enhancements over Mistral OCR 2, emphasizing key improvements in four primary areas. First, the model offers better handwriting recognition, including more accurate interpretation of cursive text and annotations on printed templates. Second, it enhances forms processing by improving the detection of boxes, labels, and handwritten entries in complex layouts, which are frequently found in invoices and compliance documents. Third, it demonstrates greater robustness in handling scanned pages, overcoming challenges such as compression artifacts and low resolutions. Finally, it excels at reconstructing complex table structures with various hierarchies and can return HTML tables that maintain proper layout.

The pricing structure for Mistral OCR 3 is straightforward, with costs set at $2 per 1,000 pages for standard OCR and $3 per 1,000 pages for pages with structured annotations. When used through the Batch Inference API, the effective cost can drop to $1 per 1,000 pages, incentivizing large-scale processing. The model also integrates structured annotations and bounding box extraction features, enabling developers to label specific document regions and retrieve bounding boxes for better content mapping in downstream systems.

In summary, Mistral OCR 3 represents a significant advancement in optical character recognition technology, combining competitive pricing with enhanced capabilities. With its robust feature set, the model positions itself as a strong contender in both traditional and AI-native OCR landscapes, delivering valuable tools for document processing and analysis. As businesses increasingly seek efficiency and accuracy in document handling, Mistral’s latest offering could play a pivotal role in meeting these demands.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Tesco partners with Mistral AI to establish a three-year joint AI lab aimed at enhancing retail operations and customer interactions through generative AI solutions.

AI Tools

Mistral introduces a beta Workflow feature to enhance document processing and integrations, streamlining complex tasks for teams using its platform.

Top Stories

Mistral AI unveils its beta Workflows tool, positioning itself to disrupt LangChain's middleware dominance by offering a unified orchestration solution for enterprise AI.

Top Stories

Mistral AI launches Codestral, a 22B parameter coding model scoring 81.1% on HumanEval, challenging proprietary systems with advanced efficiency and accessibility.

Top Stories

Mistral AI opens offices in Lausanne, leveraging Switzerland's robust AI ecosystem and venture capital network to drive deep-tech innovation and funding opportunities.

Top Stories

Mistral AI co-founder linked to Meta's use of millions of pirated books, as a court ruling on fair use sets a precedent for AI...

Top Stories

Amazon Web Services streamlines deployment of Mistral's Voxtral models on SageMaker, enhancing multimodal AI with flexible integration for developers.

Top Stories

Mistral AI launches Mistral OCR 3, achieving a 74% accuracy boost over its predecessor, now available at $2 per 1,000 pages, revolutionizing document processing.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.