DeepSeek Launches V4 Models with 1.6 Trillion Parameters, Closing Gap with Leading AI

DeepSeek launches its V4 models, featuring 1.6 trillion parameters, significantly underpricing rivals while enhancing efficiency and performance in AI reasoning.

Staff

Published

2 hours ago

Chinese AI lab DeepSeek has launched two preview versions of its newest large language model, DeepSeek V4, a much-awaited update to last year’s V3.2 model and the accompanying R1 reasoning model that took the AI world by storm.

The company announced that both DeepSeek V4 Flash and V4 Pro are mixture-of-experts models featuring context windows of 1 million tokens each, allowing for the integration of extensive codebases or documents into prompts. This mixture-of-experts approach activates only a selected number of parameters per task, which helps to reduce inference costs.

The Pro model boasts a total of 1.6 trillion parameters, with 49 billion active, making it the largest open-weight model currently available. This surpasses rivals such as Moonshot AI’s Kimi K 2.6 with 1.1 trillion, MiniMax’s M1 at 456 billion, and more than doubling the 671 billion parameters of DeepSeek V3.2. The smaller, V4 Flash model contains 284 billion parameters, with 13 billion active.

DeepSeek asserts that both V4 models are more efficient and high-performing than V3.2 due to architectural enhancements, claiming they have nearly “closed the gap” with the leading models, whether open or closed, on reasoning benchmarks. The company maintains that its new V4-Pro-Max model surpasses its open-source competitors across reasoning benchmarks, even outpacing OpenAI’s GPT-5.2 and Gemini 3.0 Pro in certain tasks. In coding competition benchmarks, DeepSeek claims the performance of both V4 models is “comparable to GPT-5.4.”

Despite these advancements, the models appear to lag behind frontier models in knowledge tests, particularly OpenAI’s GPT-5.4 and Google’s Gemini 3.1 Pro. This discrepancy suggests a “developmental trajectory that trails state-of-the-art frontier models by approximately 3 to 6 months,” according to the lab.

Both V4 Flash and V4 Pro are limited to text-only capabilities, contrasting with many closed-source competitors that offer multi-modal functionalities, including audio, video, and image processing.

In terms of pricing, DeepSeek V4 is significantly more affordable than existing frontier models. The smaller V4 Flash is priced at $0.14 per million input tokens and $0.28 per million output tokens, undercutting GPT-5.4 Nano, Gemini 3.1 Flash, GPT-5.4 Mini, and Claude Haiku 4.5. The larger V4 Pro has a cost of $0.145 per million input tokens and $3.48 per million output tokens, also underpricing competitors like Gemini 3.1 Pro, GPT-5.5, Claude Opus 4.7, and GPT-5.4.

The announcement follows a day after the U.S. government accused China of large-scale theft of American AI intellectual property through numerous proxy accounts. DeepSeek itself faces allegations from Anthropic and OpenAI regarding the “distillation,” or copying, of their AI models.

As DeepSeek enters the competitive landscape with its V4 models, the implications for its positioning in the rapidly evolving AI sector remain significant, particularly as the race for dominance intensifies amid allegations of intellectual property disputes and advancing technological capabilities.

AI Research

DeepSeek Launches New Flagship AI Model One Year After Major Breakthrough

DeepSeek unveils AI-2023, boosting processing speed by 50%, positioning the company to capture a $1 trillion AI market by 2025.

Staff9 hours ago

DeepSeek Launches V4 AI Model Preview, Surpassing Open-Source Competitors

DeepSeek unveils its V4 AI model, outpacing open-source rivals and attracting funding discussions from Alibaba and Tencent, with a projected valuation over $20 billion.

Staff10 hours ago

DeepSeek Launches V4 API with 2M Token Context, Undercutting OpenAI and Anthropic Prices

DeepSeek's V4 API launches with a groundbreaking 2-million-token context window, challenging OpenAI and Anthropic while offering competitive pricing at $2.80 per million input tokens.

Staff12 hours ago

Tencent and Alibaba Eye $40B AI Startup DeepSeek, Seek Major Stake in Funding Round

Tencent aims for a 20% stake in $40B AI startup DeepSeek as Alibaba joins funding talks, intensifying the competition in China's AI landscape

Staff1 day ago

Xiaomi Launches MiMo-V2.5 Series, Achieving 50% Token Efficiency Gain Over Competitors

Xiaomi's MiMo-V2.5 series achieves a groundbreaking 50% token efficiency gain over competitors while introducing advanced models for intelligent agent applications.

Staff2 days ago

Chinese AI Models Surpass U.S. in Downloads, Led by DeepSeek’s R1 with 17.1% Share

Chinese AI models, led by DeepSeek's R1, capture 17.1% of global downloads, surpassing the U.S. as open-source innovation reshapes AI development.

Staff2 days ago

AI Generative

Moonshot AI Launches Kimi-K2.6 Model with 1T Parameters, Surpassing GPT-5.4 in Benchmarks

Moonshot AI releases Kimi-K2.6, an open-source LLM surpassing GPT-5.4 with 1 trillion parameters and achieving a benchmark score of 54 on the challenging HLE-Full...

Staff4 days ago

Stanford Report Reveals China-US AI Gap Narrowed to 2.7% as DeepSeek Enters Top 10

Stanford's 2026 AI Index reveals the China-US AI performance gap has narrowed to just 2.7%, as Nvidia captures 60% of the global AI computing...

Staff6 days ago

AIPRESSA.COM

Top Stories

DeepSeek Launches V4 Models with 1.6 Trillion Parameters, Closing Gap with Leading AI

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Research

DeepSeek Launches New Flagship AI Model One Year After Major Breakthrough

Top Stories

DeepSeek Launches V4 AI Model Preview, Surpassing Open-Source Competitors

Top Stories

DeepSeek Launches V4 API with 2M Token Context, Undercutting OpenAI and Anthropic Prices

Top Stories

Tencent and Alibaba Eye $40B AI Startup DeepSeek, Seek Major Stake in Funding Round

Top Stories

Xiaomi Launches MiMo-V2.5 Series, Achieving 50% Token Efficiency Gain Over Competitors

Top Stories

Chinese AI Models Surpass U.S. in Downloads, Led by DeepSeek’s R1 with 17.1% Share

AI Generative

Moonshot AI Launches Kimi-K2.6 Model with 1T Parameters, Surpassing GPT-5.4 in Benchmarks

Top Stories

Stanford Report Reveals China-US AI Gap Narrowed to 2.7% as DeepSeek Enters Top 10