DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

DeepSeek’s V4-Pro eclipses GPT-5 and Claude in key benchmarks, achieving a Codeforces rating of 3,206 while undercutting OpenAI’s costs by 89% per million tokens.

Staff

Published

3 hours ago

China’s DeepSeek has made a notable entry into the competitive landscape of artificial intelligence with its new model, the V4, showcased at a recent technology event in Silicon Valley. This Hangzhou-based company is gaining attention for its ability to outperform several well-known American models in specific benchmarks, signaling a potential shift in the global AI arena.

DeepSeek launched two models: the V4-Pro, designed for expert users with a vast 1.6 trillion parameters, and the V4-Flash, which offers a more accessible 284 billion parameters. Both models feature a one-million-token context window, a significant enhancement for handling complex data inputs. What sets these models apart is that they are open source, available for download via Hugging Face, allowing users to deploy them locally, although V4-Pro requires substantial VRAM for optimal performance.

In various technical assessments, V4-Pro has demonstrated remarkable capabilities, particularly in coding tasks. For instance, it achieved a Codeforces rating of 3,206, surpassing GPT-5.4’s score of 3,168 and Gemini 3.1’s 3,052, thereby establishing itself as the leading open model for competitive programming. On the LiveCodeBench, it scored 93.5, edging out Claude Opus 4.6’s 88.8 and Gemini 91.7. Similar performance was noted in agentic tasks, with V4-Pro scoring 51.8 on Toolathlon, again outperforming Claude (47.2) and Gemini (48.8). The faster V4-Flash model competes effectively on simpler tasks, providing a cost-efficient alternative without sacrificing performance.

Despite these successes, V4-Pro has areas for improvement compared to its rivals. Claude’s Opus 4.6 leads in long-context retrieval, achieving a score of 92.9 on MRCR 1M, significantly outpacing V4-Pro’s 83.5. Moreover, GPT-5.4 maintains an advantage on Terminal Bench 2.0, scoring 75.1 against V4-Pro’s 67.9. Nevertheless, DeepSeek’s competitive pricing structure could reshape customer choices; V4-Pro costs $3.48 per million output tokens, a striking contrast to OpenAI’s $30 and Anthropic’s $25 for similar workloads.

This pricing advantage may attract developers looking to incorporate AI capabilities into their applications, as the financial barrier to entry remains a crucial consideration in the burgeoning AI market. The disparity in costs could position DeepSeek as a compelling alternative for businesses and developers eager to leverage advanced AI without incurring prohibitive expenses.

As AI technology continues to evolve rapidly, DeepSeek’s advancements underscore the potential for increased competition in a space traditionally dominated by American firms. The release of the V4 models not only highlights DeepSeek’s growing influence but also signals a broader trend of innovation and diversity within the global AI landscape. With these developments, the stage is set for further advancements as companies strive to meet the escalating demands of AI applications across various sectors.

China Halts Meta’s $2B Acquisition of AI Startup Manus Over Investment Prohibitions

China halts Meta's $2 billion acquisition of AI startup Manus, citing foreign investment prohibitions amid rising scrutiny of tech transactions

Staff14 hours ago

Google Invests $10B in Anthropic, Sparking AI ‘Frenemy’ Alliances Among Tech Giants

Google invests $10 billion in Anthropic, enhancing its AI capabilities and cloud services while signaling a shift towards collaborative 'frenemy' alliances among tech giants.

Staff18 hours ago

AI Technology

DeepSeek Launches 1.6 Trillion Parameter V4 Model on Huawei Chips Amid U.S. IP Theft Claims

DeepSeek unveils its 1.6 trillion parameter V4 model optimized for Huawei chips, priced at $3.48 per million tokens, amid U.S. IP theft allegations.

Staff2 days ago

OpenAI Slashes Prices, Pressuring Anthropic’s Premium Model Amid AI Cost War

OpenAI slashes token prices to $5, pressuring Anthropic’s premium Claude Opus model as competition intensifies in the AI market.

Staff2 days ago

DeepSeek Launches V4 Model, Surpassing Claude Opus 4.6 with Superior Efficiency

DeepSeek's DeepSeek-V4 model, boasting 1.6 trillion parameters, outperforms Claude Opus 4.6, achieving top benchmarks with 1/3.7th the processing time.

Staff2 days ago

AI Generative

DeepSeek Launches V4 AI Model with Enhanced Reasoning, Challenging OpenAI and Google

DeepSeek launches its V4 AI models with 1 million-token context windows and claims superior reasoning capabilities, challenging OpenAI and Google for market dominance.

Staff2 days ago

AI Technology

NVIDIA Stock Forecast Predicts 119% Upside Amid Strong AI Chip Demand

NVIDIA forecasts a $78B revenue for Q1 2026, with analysts predicting a 119% stock upside amid surging demand for AI chips despite geopolitical risks.

Staff2 days ago

DeepSeek Seeks $1.8 Billion Investment from Tencent and Alibaba, Valuation Tops $20 Billion

DeepSeek is in talks for a $1.8 billion investment from Tencent and Alibaba, potentially valuing the AI firm at $20 billion amid talent losses...

Staff2 days ago

AIPRESSA.COM

Top Stories

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

China Halts Meta’s $2B Acquisition of AI Startup Manus Over Investment Prohibitions

Top Stories

Google Invests $10B in Anthropic, Sparking AI ‘Frenemy’ Alliances Among Tech Giants

AI Technology

DeepSeek Launches 1.6 Trillion Parameter V4 Model on Huawei Chips Amid U.S. IP Theft Claims

Top Stories

OpenAI Slashes Prices, Pressuring Anthropic’s Premium Model Amid AI Cost War

Top Stories

DeepSeek Launches V4 Model, Surpassing Claude Opus 4.6 with Superior Efficiency

AI Generative

DeepSeek Launches V4 AI Model with Enhanced Reasoning, Challenging OpenAI and Google

AI Technology

NVIDIA Stock Forecast Predicts 119% Upside Amid Strong AI Chip Demand

Top Stories

DeepSeek Seeks $1.8 Billion Investment from Tencent and Alibaba, Valuation Tops $20 Billion