Connect with us

Hi, what are you looking for?

AI Generative

OpenAI Launches GPT-5.3-Codex with 57% Benchmark Score, Anthropic Responds with Claude Opus 4.6

OpenAI unveils GPT-5.3-Codex, achieving a 57% benchmark score and positioning itself against Anthropic’s Claude Opus 4.6 in the AI coding wars.

OpenAI has launched its latest coding model, GPT-5.3-Codex, claiming it to be the most advanced coding agent to date. This announcement occurred simultaneously with Anthropic’s unveiling of Claude Opus 4.6, marking the beginning of what analysts are labeling the AI coding wars, with both companies vying for dominance in the enterprise software development sector. The dual launches coincide with heightened competition, including rival Super Bowl advertisements scheduled for Sunday, as executives from both firms engage in a public exchange of criticisms regarding business ethics and operational strategies.

Shortly after the launch, OpenAI CEO Sam Altman expressed his enthusiasm on X, noting the model’s self-improving capabilities. “I love building with this model; it feels like more of a step forward than the benchmarks suggest,” he stated, indicating that GPT-5.3-Codex not only surpassed its predecessors in performance but also played a pivotal role in its own development. According to OpenAI, earlier iterations of GPT-5.3-Codex were utilized to debug training runs and manage deployment infrastructure, marking a significant milestone in AI evolution as it is described as “our first model that was instrumental in creating itself.”

In terms of performance, GPT-5.3-Codex achieved remarkable results across several industry benchmarks. It scored 57% on SWE-Bench Pro, a challenging evaluation that involves four programming languages, and secured a score of 77.3% on Terminal-Bench 2.0, which tests essential terminal skills for coding agents. The latter result reflects a substantial leap from the previous model, GPT-5.2-Codex, which scored 64.0%. OpenAI claims that this improved performance is accompanied by significant efficiency gains, using less than half the tokens of its predecessor for equivalent tasks while also delivering over 25% faster inference per token.

OpenAI has positioned GPT-5.3-Codex as more than just a coding assistant, asserting that the model can perform a wide array of tasks typically associated with software development. These capabilities include debugging, deployment, monitoring, and even drafting product requirement documents. The model also shows promise in performing various knowledge-work tasks, highlighting OpenAI’s ambition to penetrate the broader enterprise productivity software market, which includes established contenders like Microsoft and Salesforce.

In a notable development, OpenAI classified GPT-5.3-Codex as its first “high capability” model for cybersecurity-related tasks, underlining new security protocols. While Altman emphasized that there is no definitive evidence that it can automate cyber attacks completely, he stated that the firm is taking precautionary measures, including a $10 million commitment in API credits to boost cyber defense. The company is also expanding its private beta for Aardvark, its security research agent, and partnering with open-source maintainers for free codebase scanning initiatives.

The competitive landscape has intensified as OpenAI and Anthropic prepare to showcase major ads during the Super Bowl, with Altman labeling Anthropic’s campaign as “clearly dishonest.” This rivalry has escalated, with both companies scheduling their product announcements for the same time and exchanging personal jabs. Anthropic’s Claude Opus 4.6 is touted as its “smartest model,” designed for more reliable performance in complex coding tasks. In contrast, Altman has publicly criticized Anthropic’s approach, asserting that OpenAI’s user base significantly outstrips that of its competitor.

Recent survey data from Andreessen Horowitz indicates that enterprise spending on AI tools has surged dramatically, reaching an average of $7 million in 2025, significantly higher than earlier projections. Despite OpenAI’s leading market share, which is expected to decline from 62% in 2024 to 53% in 2026, Anthropic’s share is projected to grow from 14% to 18% over the same period. The survey also notes that while OpenAI leads in overall usage, a smaller percentage of its customers are utilizing its most advanced models compared to Anthropic and Google.

The financial stakes in the AI sector are immense, with both companies racing toward market dominance amid unprecedented capital requirements. Anthropic is reportedly seeking over $20 billion in a funding round, while OpenAI faces more than $1 trillion in obligations to its backers, which include tech giants like Microsoft and Nvidia. GPT-5.3-Codex was specifically trained on advanced Nvidia systems, underscoring the ongoing financial pressures to demonstrate profitability in a competitive landscape.

Looking ahead, OpenAI has made GPT-5.3-Codex available immediately for paid ChatGPT users via various platforms. The model comes equipped with an interactive feature allowing users to choose between “pragmatic” or “friendly” styles, enhancing user engagement. OpenAI plans to roll out additional capabilities, with Altman expressing confidence that Codex will thrive in the evolving AI landscape. “This time belongs to the builders, not the people who want to control them,” he concluded, emphasizing the foundational ethos driving the current race in AI coding.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Amazon's stake in Anthropic skyrockets to $60.6 billion, reflecting a seven-fold increase and solidifying its position in the AI market.

AI Regulation

Goldman Sachs partners with Anthropic to deploy Claude AI agents for accounting and compliance, enhancing efficiency in financial tasks amid rising automation interest.

Top Stories

Walmart leverages AI-driven fulfillment centers, with over 60% of U.S. stores receiving automated freight, redefining inventory strategies for speed and consumer demand.

Top Stories

Elon Musk's xAI faces UK investigation as Grok generates 99% non-consensual sexualized images of women, raising critical online safety concerns.

AI Generative

Anthropic's Claude Opus 4.6 launches with a 144 Elo point advantage over GPT-5.2, enhancing AI-driven productivity and safety for enterprise applications

AI Business

Goldman Sachs partners with Anthropic to develop AI agents aimed at automating labor-intensive banking tasks, enhancing efficiency in compliance and accounting processes.

AI Technology

OpenAI identifies five essential skills for aspiring prompt engineers, highlighting the increasing demand for expertise as AI integration expands across industries.

AI Generative

Meta tests its standalone Vibes app for AI-generated videos, aiming for a freemium model while positioning it as a major competitor to OpenAI's Sora.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.