DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

DeepSeek’s V4-Pro eclipses GPT-5 and Claude in key benchmarks, achieving a Codeforces rating of 3,206 while undercutting OpenAI’s costs by 89% per million tokens.

Staff

Published

27 April, 2026

China’s DeepSeek has made a notable entry into the competitive landscape of artificial intelligence with its new model, the V4, showcased at a recent technology event in Silicon Valley. This Hangzhou-based company is gaining attention for its ability to outperform several well-known American models in specific benchmarks, signaling a potential shift in the global AI arena.

DeepSeek launched two models: the V4-Pro, designed for expert users with a vast 1.6 trillion parameters, and the V4-Flash, which offers a more accessible 284 billion parameters. Both models feature a one-million-token context window, a significant enhancement for handling complex data inputs. What sets these models apart is that they are open source, available for download via Hugging Face, allowing users to deploy them locally, although V4-Pro requires substantial VRAM for optimal performance.

In various technical assessments, V4-Pro has demonstrated remarkable capabilities, particularly in coding tasks. For instance, it achieved a Codeforces rating of 3,206, surpassing GPT-5.4’s score of 3,168 and Gemini 3.1’s 3,052, thereby establishing itself as the leading open model for competitive programming. On the LiveCodeBench, it scored 93.5, edging out Claude Opus 4.6’s 88.8 and Gemini 91.7. Similar performance was noted in agentic tasks, with V4-Pro scoring 51.8 on Toolathlon, again outperforming Claude (47.2) and Gemini (48.8). The faster V4-Flash model competes effectively on simpler tasks, providing a cost-efficient alternative without sacrificing performance.

Despite these successes, V4-Pro has areas for improvement compared to its rivals. Claude’s Opus 4.6 leads in long-context retrieval, achieving a score of 92.9 on MRCR 1M, significantly outpacing V4-Pro’s 83.5. Moreover, GPT-5.4 maintains an advantage on Terminal Bench 2.0, scoring 75.1 against V4-Pro’s 67.9. Nevertheless, DeepSeek’s competitive pricing structure could reshape customer choices; V4-Pro costs $3.48 per million output tokens, a striking contrast to OpenAI’s $30 and Anthropic’s $25 for similar workloads.

This pricing advantage may attract developers looking to incorporate AI capabilities into their applications, as the financial barrier to entry remains a crucial consideration in the burgeoning AI market. The disparity in costs could position DeepSeek as a compelling alternative for businesses and developers eager to leverage advanced AI without incurring prohibitive expenses.

As AI technology continues to evolve rapidly, DeepSeek’s advancements underscore the potential for increased competition in a space traditionally dominated by American firms. The release of the V4 models not only highlights DeepSeek’s growing influence but also signals a broader trend of innovation and diversity within the global AI landscape. With these developments, the stage is set for further advancements as companies strive to meet the escalating demands of AI applications across various sectors.

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

Anthropic's Mythos exposes thousands of critical vulnerabilities in major systems, prompting $100M in defensive action from tech giants and U.S. banks.

Rachel Torres3 May, 2026

AI Generative

AI Achieves 85% Accuracy in Predicting Mental Health Treatment Success, Paving Way for Precision Psychiatry

Generative AI achieves over 85% accuracy in predicting mental health treatment success, marking a pivotal shift toward Precision Psychiatry with $10 billion market potential...

Staff3 May, 2026

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

Anthropic accuses Moonshot AI of 3.4M unauthorized exchanges with its Claude chatbot, prompting a global U.S. State Department campaign against IP theft.

Staff2 May, 2026

AI Regulation

AI Agent Powered by Claude Deletes PocketOS Database, Ignoring Safety Protocols

Malfunctioning AI agent Cursor, powered by Anthropic’s Claude Opus 4.6, deleted PocketOS's entire database in nine seconds, disrupting car rental operations nationwide.

Staff2 May, 2026

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

DeepSeek's V4 open-source model undercuts GPT-5.5 and Claude Opus 4.7 with costs of $1.74 per million tokens, promising a disruptive shift in AI pricing...

Staff2 May, 2026

AI Technology

Amazon and Anthropic Expand AI Partnership with $100B Investment in AWS Technologies

Amazon and Anthropic expand their partnership with a $100B investment in AWS, enhancing AI infrastructure and accelerating generative AI adoption globally.

Staff1 May, 2026

AI Technology

US Lawmakers Launch Investigation into Cybersecurity Risks from PRC-Origin AI in Critical Infrastructure

US lawmakers initiate a probe into PRC-developed AI systems, citing national security risks and potential exploitation of American innovations by companies like DeepSeek and...

Staff1 May, 2026

Anthropic Launches BioMysteryBench to Evaluate AI in Complex Bioinformatics Tasks

Anthropic unveils BioMysteryBench, a benchmark that reveals Claude's 30% success on human-unsolvable bioinformatics questions, advancing AI's role in complex research tasks

Staff1 May, 2026

AIPRESSA.COM

Top Stories

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

AI Generative

AI Achieves 85% Accuracy in Predicting Mental Health Treatment Success, Paving Way for Precision Psychiatry

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

AI Regulation

AI Agent Powered by Claude Deletes PocketOS Database, Ignoring Safety Protocols

Top Stories

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

AI Technology

Amazon and Anthropic Expand AI Partnership with $100B Investment in AWS Technologies

AI Technology

US Lawmakers Launch Investigation into Cybersecurity Risks from PRC-Origin AI in Critical Infrastructure

Top Stories

Anthropic Launches BioMysteryBench to Evaluate AI in Complex Bioinformatics Tasks