AI Generative

OpenAI Launches GPT-5.3-Codex with 57% Benchmark Score, Anthropic Responds with Claude Opus 4.6

OpenAI unveils GPT-5.3-Codex, achieving a 57% benchmark score and positioning itself against Anthropic’s Claude Opus 4.6 in the AI coding wars.

Staff

Published

1 hour ago

OpenAI has launched its latest coding model, GPT-5.3-Codex, claiming it to be the most advanced coding agent to date. This announcement occurred simultaneously with Anthropic’s unveiling of Claude Opus 4.6, marking the beginning of what analysts are labeling the AI coding wars, with both companies vying for dominance in the enterprise software development sector. The dual launches coincide with heightened competition, including rival Super Bowl advertisements scheduled for Sunday, as executives from both firms engage in a public exchange of criticisms regarding business ethics and operational strategies.

Shortly after the launch, OpenAI CEO Sam Altman expressed his enthusiasm on X, noting the model’s self-improving capabilities. “I love building with this model; it feels like more of a step forward than the benchmarks suggest,” he stated, indicating that GPT-5.3-Codex not only surpassed its predecessors in performance but also played a pivotal role in its own development. According to OpenAI, earlier iterations of GPT-5.3-Codex were utilized to debug training runs and manage deployment infrastructure, marking a significant milestone in AI evolution as it is described as “our first model that was instrumental in creating itself.”

In terms of performance, GPT-5.3-Codex achieved remarkable results across several industry benchmarks. It scored 57% on SWE-Bench Pro, a challenging evaluation that involves four programming languages, and secured a score of 77.3% on Terminal-Bench 2.0, which tests essential terminal skills for coding agents. The latter result reflects a substantial leap from the previous model, GPT-5.2-Codex, which scored 64.0%. OpenAI claims that this improved performance is accompanied by significant efficiency gains, using less than half the tokens of its predecessor for equivalent tasks while also delivering over 25% faster inference per token.

OpenAI has positioned GPT-5.3-Codex as more than just a coding assistant, asserting that the model can perform a wide array of tasks typically associated with software development. These capabilities include debugging, deployment, monitoring, and even drafting product requirement documents. The model also shows promise in performing various knowledge-work tasks, highlighting OpenAI’s ambition to penetrate the broader enterprise productivity software market, which includes established contenders like Microsoft and Salesforce.

In a notable development, OpenAI classified GPT-5.3-Codex as its first “high capability” model for cybersecurity-related tasks, underlining new security protocols. While Altman emphasized that there is no definitive evidence that it can automate cyber attacks completely, he stated that the firm is taking precautionary measures, including a $10 million commitment in API credits to boost cyber defense. The company is also expanding its private beta for Aardvark, its security research agent, and partnering with open-source maintainers for free codebase scanning initiatives.

The competitive landscape has intensified as OpenAI and Anthropic prepare to showcase major ads during the Super Bowl, with Altman labeling Anthropic’s campaign as “clearly dishonest.” This rivalry has escalated, with both companies scheduling their product announcements for the same time and exchanging personal jabs. Anthropic’s Claude Opus 4.6 is touted as its “smartest model,” designed for more reliable performance in complex coding tasks. In contrast, Altman has publicly criticized Anthropic’s approach, asserting that OpenAI’s user base significantly outstrips that of its competitor.

Recent survey data from Andreessen Horowitz indicates that enterprise spending on AI tools has surged dramatically, reaching an average of $7 million in 2025, significantly higher than earlier projections. Despite OpenAI’s leading market share, which is expected to decline from 62% in 2024 to 53% in 2026, Anthropic’s share is projected to grow from 14% to 18% over the same period. The survey also notes that while OpenAI leads in overall usage, a smaller percentage of its customers are utilizing its most advanced models compared to Anthropic and Google.

The financial stakes in the AI sector are immense, with both companies racing toward market dominance amid unprecedented capital requirements. Anthropic is reportedly seeking over $20 billion in a funding round, while OpenAI faces more than $1 trillion in obligations to its backers, which include tech giants like Microsoft and Nvidia. GPT-5.3-Codex was specifically trained on advanced Nvidia systems, underscoring the ongoing financial pressures to demonstrate profitability in a competitive landscape.

Looking ahead, OpenAI has made GPT-5.3-Codex available immediately for paid ChatGPT users via various platforms. The model comes equipped with an interactive feature allowing users to choose between “pragmatic” or “friendly” styles, enhancing user engagement. OpenAI plans to roll out additional capabilities, with Altman expressing confidence that Codex will thrive in the evolving AI landscape. “This time belongs to the builders, not the people who want to control them,” he concluded, emphasizing the foundational ethos driving the current race in AI coding.

Amazon’s Anthropic Stake Surges to $60.6 Billion, Boosting AI Investment Value Sevenfold

Amazon's stake in Anthropic skyrockets to $60.6 billion, reflecting a seven-fold increase and solidifying its position in the AI market.

Staff1 hour ago

AI Regulation

Goldman Sachs Deploys Anthropic’s Claude AI for Core Accounting and Compliance Functions

Goldman Sachs partners with Anthropic to deploy Claude AI agents for accounting and compliance, enhancing efficiency in financial tasks amid rising automation interest.

Staff2 hours ago

AI Redefines Inventory Strategies: Retailers Must Adapt to Consumer Demand for Speed

Walmart leverages AI-driven fulfillment centers, with over 60% of U.S. stores receiving automated freight, redefining inventory strategies for speed and consumer demand.

Staff5 hours ago

Grok’s Misuse Sparks UK Investigation Amid Surge in Non-Consensual AI Deepfakes

Elon Musk's xAI faces UK investigation as Grok generates 99% non-consensual sexualized images of women, raising critical online safety concerns.

Staff5 hours ago

AI Generative

Anthropic Launches Claude Opus 4.6, Achieving 144 Elo Point Lead Over GPT-5.2

Anthropic's Claude Opus 4.6 launches with a 144 Elo point advantage over GPT-5.2, enhancing AI-driven productivity and safety for enterprise applications

Staff6 hours ago

AI Business

Goldman Sachs Partners with Anthropic to Launch AI Agents for Banking Automation

Goldman Sachs partners with Anthropic to develop AI agents aimed at automating labor-intensive banking tasks, enhancing efficiency in compliance and accounting processes.

Marcus Chen8 hours ago

AI Technology

OpenAI Reveals 5 Essential Skills for Aspiring Prompt Engineers in AI Industry

OpenAI identifies five essential skills for aspiring prompt engineers, highlighting the increasing demand for expertise as AI integration expands across industries.

Staff9 hours ago

AI Generative

Meta Launches Standalone Vibes App for AI-Generated Videos, Testing Freemium Model

Meta tests its standalone Vibes app for AI-generated videos, aiming for a freemium model while positioning it as a major competitor to OpenAI's Sora.

Staff10 hours ago

AIPRESSA.COM

AI Generative

OpenAI Launches GPT-5.3-Codex with 57% Benchmark Score, Anthropic Responds with Claude Opus 4.6

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

You May Also Like

Top Stories

Amazon’s Anthropic Stake Surges to $60.6 Billion, Boosting AI Investment Value Sevenfold

AI Regulation

Goldman Sachs Deploys Anthropic’s Claude AI for Core Accounting and Compliance Functions

Top Stories

AI Redefines Inventory Strategies: Retailers Must Adapt to Consumer Demand for Speed

Top Stories

Grok’s Misuse Sparks UK Investigation Amid Surge in Non-Consensual AI Deepfakes

AI Generative

Anthropic Launches Claude Opus 4.6, Achieving 144 Elo Point Lead Over GPT-5.2

AI Business

Goldman Sachs Partners with Anthropic to Launch AI Agents for Banking Automation

AI Technology

OpenAI Reveals 5 Essential Skills for Aspiring Prompt Engineers in AI Industry

AI Generative

Meta Launches Standalone Vibes App for AI-Generated Videos, Testing Freemium Model