AI Generative

Anthropic Launches Claude Opus 4.6, Achieving 144 Elo Point Lead Over GPT-5.2

Anthropic’s Claude Opus 4.6 launches with a 144 Elo point advantage over GPT-5.2, enhancing AI-driven productivity and safety for enterprise applications

Staff

Published

6 February, 2026

Anthropic has unveiled its latest AI model, Claude Opus 4.6, which boasts significant enhancements over its predecessor. Announced on October 3, 2023, the model features improved coding competencies, a larger context window of 1 million tokens in beta, and enhanced capabilities for executing complex tasks autonomously. This model is designed to assist users in various everyday tasks, including financial analysis, research, and document creation, thereby elevating productivity in workplace environments.

Claude Opus 4.6 has demonstrated exceptional performance across multiple evaluations. It achieved the highest score on the Terminal-Bench 2.0 coding evaluation and surpassed other models in Humanity’s Last Exam, a challenging multidisciplinary reasoning test. Moreover, it strongly outperformed OpenAI’s GPT-5.2 by approximately 144 Elo points on the GDPval-AA benchmark, which evaluates performance in economically valuable knowledge work tasks across finance and legal domains. Claude Opus 4.6 also excelled in BrowseComp, an assessment of locating complex information online, underscoring its superior capabilities in information retrieval.

The model’s safety profile also stands out, exhibiting misalignment rates comparable to or better than any other leading AI models. According to the extensive safety evaluations conducted, Claude Opus 4.6 maintains low rates of undesirable behaviors, ensuring that it aligns with user well-being and safety standards.

In addition to these capabilities, Claude Opus 4.6 introduces several new features aimed at enhancing collaborative work. The model allows users to assemble teams of autonomous agents within the Claude Code environment, enabling multiple agents to tackle tasks concurrently. Furthermore, it incorporates adaptive thinking, allowing the model to determine when to engage in deeper reasoning, and offers developers new controls over intelligence, speed, and cost through various effort settings.

Substantial upgrades have also been made to Claude for Excel and a research preview of Claude in PowerPoint has been released. These updates make the model more adept at handling intricate tasks typically required in office settings, like processing and structuring data in Excel before visually presenting it in PowerPoint.

Feedback from early-access partners reflects Claude Opus 4.6’s advancements. Notion users highlighted the model’s capability to handle ambitious requests autonomously, while developers noted its effectiveness in managing complex, multi-step coding workflows. Other users emphasized the model’s proficiency in agentic planning, where it successfully breaks down intricate tasks into manageable subtasks and executes them with accuracy. This responsiveness has led to enhanced collaboration and efficiency across various teams.

Performance metrics further validate these claims. Claude Opus 4.6 reportedly improved performance on a blind ranking against its predecessor in cybersecurity investigations, achieving superior results in 38 out of 40 cases. Additionally, it attained a score of 90.2% on the BigLaw Bench, showcasing its capabilities in legal reasoning.

Looking forward, Claude Opus 4.6 is poised to change how enterprises leverage AI in their operations. With a focus on comprehensive safety evaluations, the model not only enhances productivity but does so with a view toward ethical considerations. Users can expect continual improvements as the model adapts to new challenges and incorporates feedback from real-world applications.

Available today on claude.ai, via its API, and across major cloud platforms, Claude Opus 4.6 maintains its pricing structure at $5/$25 per million tokens, providing an accessible option for developers and organizations aiming to integrate advanced AI capabilities into their workflows.

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

Anthropic's Mythos exposes thousands of critical vulnerabilities in major systems, prompting $100M in defensive action from tech giants and U.S. banks.

Rachel Torres3 May, 2026

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

Staff3 May, 2026

AI Generative

OpenAI Launches GPT Image 2, Surpassing Google Nano Banana 2 in Key Categories

OpenAI unveils GPT Image 2, achieving a record 242-point lead over competitors, transforming the AI image generation landscape with native reasoning capabilities.

Staff2 May, 2026

AI Business

Jensen Huang Critiques AI Doom Predictions, Calls for Fact-Based Discussions

Nvidia CEO Jensen Huang urges industry leaders to avoid alarmist claims about AI's future, citing concerns over inaccurate predictions like a 50% job displacement...

Marcus Chen2 May, 2026

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

Anthropic accuses Moonshot AI of 3.4M unauthorized exchanges with its Claude chatbot, prompting a global U.S. State Department campaign against IP theft.

Staff2 May, 2026

AI Technology

Apple Faces Mac Mini and Studio Shortage as OpenClaw Drives AI Demand Surge

Apple CEO Tim Cook warns of several-month supply shortages for the Mac mini and Mac Studio as demand surges, pushing Mac revenue to $8.4...

Staff2 May, 2026

AI Cybersecurity

Anthropic Launches Beta of Claude Security AI Tools to Combat Cyber Threats

Anthropic unveils Claude Security’s public beta, leveraging AI to automate vulnerability scanning and patch generation, poised to enhance enterprise cybersecurity.

Rachel Torres2 May, 2026

AIPRESSA.COM

AI Generative

Anthropic Launches Claude Opus 4.6, Achieving 144 Elo Point Lead Over GPT-5.2

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

AI Generative

OpenAI Launches GPT Image 2, Surpassing Google Nano Banana 2 in Key Categories

AI Business

Jensen Huang Critiques AI Doom Predictions, Calls for Fact-Based Discussions

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

AI Technology

Apple Faces Mac Mini and Studio Shortage as OpenClaw Drives AI Demand Surge

AI Cybersecurity

Anthropic Launches Beta of Claude Security AI Tools to Combat Cyber Threats