DeepSeek Launches V4 Models with 1.6 Trillion Parameters, Closing Gap with Leading AI

DeepSeek launches its V4 models, featuring 1.6 trillion parameters, significantly underpricing rivals while enhancing efficiency and performance in AI reasoning.

Staff

Published

24 April, 2026

Chinese AI lab DeepSeek has launched two preview versions of its newest large language model, DeepSeek V4, a much-awaited update to last year’s V3.2 model and the accompanying R1 reasoning model that took the AI world by storm.

The company announced that both DeepSeek V4 Flash and V4 Pro are mixture-of-experts models featuring context windows of 1 million tokens each, allowing for the integration of extensive codebases or documents into prompts. This mixture-of-experts approach activates only a selected number of parameters per task, which helps to reduce inference costs.

The Pro model boasts a total of 1.6 trillion parameters, with 49 billion active, making it the largest open-weight model currently available. This surpasses rivals such as Moonshot AI’s Kimi K 2.6 with 1.1 trillion, MiniMax’s M1 at 456 billion, and more than doubling the 671 billion parameters of DeepSeek V3.2. The smaller, V4 Flash model contains 284 billion parameters, with 13 billion active.

DeepSeek asserts that both V4 models are more efficient and high-performing than V3.2 due to architectural enhancements, claiming they have nearly “closed the gap” with the leading models, whether open or closed, on reasoning benchmarks. The company maintains that its new V4-Pro-Max model surpasses its open-source competitors across reasoning benchmarks, even outpacing OpenAI’s GPT-5.2 and Gemini 3.0 Pro in certain tasks. In coding competition benchmarks, DeepSeek claims the performance of both V4 models is “comparable to GPT-5.4.”

Despite these advancements, the models appear to lag behind frontier models in knowledge tests, particularly OpenAI’s GPT-5.4 and Google’s Gemini 3.1 Pro. This discrepancy suggests a “developmental trajectory that trails state-of-the-art frontier models by approximately 3 to 6 months,” according to the lab.

Both V4 Flash and V4 Pro are limited to text-only capabilities, contrasting with many closed-source competitors that offer multi-modal functionalities, including audio, video, and image processing.

In terms of pricing, DeepSeek V4 is significantly more affordable than existing frontier models. The smaller V4 Flash is priced at $0.14 per million input tokens and $0.28 per million output tokens, undercutting GPT-5.4 Nano, Gemini 3.1 Flash, GPT-5.4 Mini, and Claude Haiku 4.5. The larger V4 Pro has a cost of $0.145 per million input tokens and $3.48 per million output tokens, also underpricing competitors like Gemini 3.1 Pro, GPT-5.5, Claude Opus 4.7, and GPT-5.4.

The announcement follows a day after the U.S. government accused China of large-scale theft of American AI intellectual property through numerous proxy accounts. DeepSeek itself faces allegations from Anthropic and OpenAI regarding the “distillation,” or copying, of their AI models.

As DeepSeek enters the competitive landscape with its V4 models, the implications for its positioning in the rapidly evolving AI sector remain significant, particularly as the race for dominance intensifies amid allegations of intellectual property disputes and advancing technological capabilities.

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

Anthropic accuses Moonshot AI of 3.4M unauthorized exchanges with its Claude chatbot, prompting a global U.S. State Department campaign against IP theft.

Staff2 May, 2026

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

DeepSeek's V4 open-source model undercuts GPT-5.5 and Claude Opus 4.7 with costs of $1.74 per million tokens, promising a disruptive shift in AI pricing...

Staff2 May, 2026

AI Technology

US Lawmakers Launch Investigation into Cybersecurity Risks from PRC-Origin AI in Critical Infrastructure

US lawmakers initiate a probe into PRC-developed AI systems, citing national security risks and potential exploitation of American innovations by companies like DeepSeek and...

Staff1 May, 2026

AI Technology

DeepSeek Launches 1.6 Trillion Parameter V4 Model on Huawei Chips Amid U.S. IP Theft Claims

DeepSeek unveils its 1.6 trillion parameter V4 model optimized for Huawei chips, priced at $3.48 per million tokens, amid U.S. IP theft allegations.

Staff26 April, 2026

AIPRESSA.COM

Top Stories

DeepSeek Launches V4 Models with 1.6 Trillion Parameters, Closing Gap with Leading AI

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

Top Stories

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

AI Technology

US Lawmakers Launch Investigation into Cybersecurity Risks from PRC-Origin AI in Critical Infrastructure

AI Generative

DeepSeek Launches V4 AI Model with Enhanced Reasoning and Agentic Capabilities

Top Stories

Meta’s AI Acquisition Fails as China’s DeepSeek V4 Struggles to Compete

Top Stories

Anuma Launches Private AI Platform with One Encrypted Memory for 10 Leading Models

Top Stories

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

AI Technology

DeepSeek Launches 1.6 Trillion Parameter V4 Model on Huawei Chips Amid U.S. IP Theft Claims