Mistral Launches AI Family with 675B Parameters, Competes Head-to-Head with DeepSeek

French startup Mistral launches four AI models, including the flagship Large 3 with 675 billion parameters, challenging DeepSeek’s dominance in the open-source arena.

Staff

Published

3 December, 2025

French AI startup Mistral made a significant leap in the competitive landscape of artificial intelligence with the release of its latest model family on Tuesday. Known for its role as the underdog in a field largely dominated by American and Chinese firms, Mistral’s new offerings are positioned to challenge existing open-source models, providing them free of charge.

The new lineup includes four models, ranging from compact personal assistants to a cutting-edge system boasting 675 billion parameters. All models are available under the permissive Apache 2.0 open-source license, allowing users to download, modify, and fine-tune them for various applications on compatible hardware.

At the forefront is the flagship model, Mistral Large 3, which utilizes a sparse Mixture-of-Experts architecture, activating only 41 billion parameters for each token processed. This engineering choice allows the model to achieve performance comparable to much larger systems while operating at a level typically associated with 40 billion parameter models.

Trained from scratch using 3,000 NVIDIA H200 GPUs, Mistral Large 3 debuted impressively, ranking second among open-source, non-reasoning models on the LMArena leaderboard. In terms of benchmark comparisons, Mistral’s leading model surpasses DeepSeek V3.1 across several metrics but trails slightly behind the newer V3.2 version.

When it comes to general knowledge and expert reasoning tasks, Mistral’s offerings hold their ground, although DeepSeek maintains an edge in coding speed and mathematical logic. Notably, this new release does not incorporate reasoning models, which limits its cognitive capabilities compared to competitors.

The smaller models in the lineup, referred to as “Ministral,” are particularly noteworthy for developers. Available in three sizes—3 billion, 8 billion, and 14 billion parameters—these models come with both base and instruct variants and support native vision input. The 3B model has garnered attention from AI researcher Simon Willison, who highlighted its capability to run entirely within a browser using WebGPU.

This capability offers unique opportunities for developers and hobbyists alike, making it suitable for applications in drones, robots, and even offline systems in vehicles. Early testing has revealed a distinctive character across the Mistral lineup; the Mistral 3 Large demonstrates conversational fluency, often mirroring the style of GPT-5 but with a more natural cadence.

However, it has also shown a tendency for repetition and overreliance on common phrases, especially in its 14B instruct variant, which users have flagged on platforms like Reddit. Despite these issues, its ability to generate long-form content remains a highlight for its size.

The smaller 3B and 8B models, while functional, sometimes produce formulaic outputs on creative tasks, although their compact size allows them to run on less powerful hardware, such as smartphones. The only other competitive option in this niche is Google’s smallest version of Gemma 3.

Enterprise interest in Mistral is already materializing, as demonstrated by HSBC‘s announcement of a multi-year partnership to implement generative AI within its operations. The bank plans to self-host the models on its infrastructure, aligning Mistral’s expertise with its internal technical capabilities—a choice particularly appealing for organizations managing sensitive customer data.

In collaboration with NVIDIA, Mistral has developed a compressed checkpoint called NVFP4 that enables Mistral Large 3 to operate on a single node powered by eight high-end NVIDIA cards. NVIDIA claims that the Ministral 3B model achieves approximately 385 tokens per second on an RTX 5090, and around 50 tokens per second on Jetson Thor for robotics applications, highlighting its efficiency and speed without compromising quality.

Future developments include a reasoning-optimized version of Large 3, although competitors like DeepSeek R1 and various Chinese models retain their advantages in explicit reasoning tasks for now. For enterprises prioritizing cutting-edge capabilities, open-source flexibility, multilingual support, and compliance with European regulations, Mistral’s emergence marks a pivotal expansion of options in the AI landscape.

OpenAI Warns U.S. Lawmakers of DeepSeek’s AI Model Replication Threat

OpenAI alerts U.S. lawmakers about Chinese startup DeepSeek’s suspected replication of American AI models, raising concerns over technology theft and security.

Staff2 hours ago

ByteDance Launches Seedance 2.0, Disrupting Hollywood with AI-Generated Content

ByteDance's Seedance 2.0 generates high-quality videos mimicking Hollywood scenes, raising concerns over copyright and the future of traditional filmmaking.

Staff23 hours ago

US Shares Drop 1.4% Amid AI Concerns; Australian Profits Rise 2.4% as Market Rotates

US shares dropped 1.4% amid AI concerns while Australian stocks surged 2.4% on profit growth, signaling a shift from tech-heavy investments.

Staff2 days ago

AI Technology

Shanghai’s Model Speed Space Launches MiniMax M2.5, Promising 100 TPS for AI Innovations

MiniMax launches the M2.5, achieving 100 TPS and transforming AI deployment costs to $0.3 input and $2.4 output per million tokens, enhancing operational efficiency.

Staff2 days ago

Anthropic Denies Military Use of Claude Amid Pentagon Contract Tensions and Policy Disputes

Anthropic denies military use of its AI system Claude amid Pentagon tensions over a potential $200M contract and ethical concerns regarding autonomy and surveillance.

Staff3 days ago

AI Generative

TikTok’s Seedance 2.0 Launch Triggers Hollywood Backlash Over IP Concerns

TikTok's Seedance 2.0, an AI video creation tool, sparks Hollywood backlash as users generate viral content featuring A-list stars like Tom Cruise and Brad...

Staff3 days ago

OpenAI Alleges DeepSeek Is Attempting to Clone ChatGPT Models for AI Training

OpenAI warns U.S. lawmakers that Chinese startup DeepSeek is allegedly cloning its ChatGPT models, raising national security concerns over AI technology theft.

Staff4 days ago

AI Technology

DOE Unveils 26 AI Challenges to Transform Nuclear Deployment and Energy Systems

DOE launches 26 AI challenges to cut nuclear deployment timelines by 50% and reduce operational costs by over 50% in a revolutionary energy initiative.

Staff5 days ago

AIPRESSA.COM

Top Stories

Mistral Launches AI Family with 675B Parameters, Competes Head-to-Head with DeepSeek

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

OpenAI Warns U.S. Lawmakers of DeepSeek’s AI Model Replication Threat

Top Stories

ByteDance Launches Seedance 2.0, Disrupting Hollywood with AI-Generated Content

Top Stories

US Shares Drop 1.4% Amid AI Concerns; Australian Profits Rise 2.4% as Market Rotates

AI Technology

Shanghai’s Model Speed Space Launches MiniMax M2.5, Promising 100 TPS for AI Innovations

Top Stories

Anthropic Denies Military Use of Claude Amid Pentagon Contract Tensions and Policy Disputes

AI Generative

TikTok’s Seedance 2.0 Launch Triggers Hollywood Backlash Over IP Concerns

Top Stories

OpenAI Alleges DeepSeek Is Attempting to Clone ChatGPT Models for AI Training

AI Technology

DOE Unveils 26 AI Challenges to Transform Nuclear Deployment and Energy Systems