AI Cybersecurity

OpenAI’s GPT-5.5 Matches Claude Mythos in Cyberattack Efficiency, Solves Puzzles in 10 Minutes

OpenAI’s GPT-5.5 autonomously executed complex cyberattacks with a 71.4% pass rate, raising alarms as U.K. officials unveil £90M to enhance cyber resilience.

Rachel Torres

Published

14 minutes ago

A U.K. government agency has reported that OpenAI’s latest artificial intelligence model, GPT-5.5, can autonomously execute complex cyberattacks, completing a 32-step corporate network simulation in two out of ten attempts. This simulation, known as “The Last Ones,” was conducted by the AI Security Institute (AISI), part of Britain’s Department of Science, Innovation and Technology, and was designed in collaboration with the cybersecurity firm SpecterOps. The findings, published Thursday, raise significant concerns regarding the implications of advanced AI capabilities in cybersecurity.

The report indicated that GPT-5.5 demonstrated offensive cyber capabilities comparable to those of Anthropic’s Claude Mythos. In a particularly notable challenge, GPT-5.5 cracked a reverse-engineering puzzle in just over ten minutes, a task that took a human security expert approximately twelve hours. This puzzle required the AI to reconstruct a custom virtual machine’s instruction set and recover a cryptographic password, showcasing the model’s advanced problem-solving abilities.

On AISI’s rigorous evaluation, GPT-5.5 achieved an average pass rate of 71.4% on the most difficult “Expert” tier of advanced cybersecurity tasks. This performance surpassed that of Claude Mythos Preview, which had a pass rate of 68.6%, and significantly exceeded the previous model, GPT-5.4, which managed only 52.4%. These results suggest that the rapid improvement of offensive AI capabilities could be part of a broader trend rather than an isolated incident.

The findings also underscore serious safety concerns. Researchers discovered a universal jailbreak that allowed GPT-5.5 to bypass its safety guardrails entirely, generating harmful content across various cyber queries. This vulnerability, developed through six hours of expert red-teaming, prompted OpenAI to update its safeguard stack. However, a configuration issue prevented AISI from verifying whether the updated measures were effective.

While AISI’s evaluations were carried out under controlled conditions, the report cautioned that such capabilities may not reflect those available to the average user, as public deployments are equipped with additional safeguards and access controls. The implications of these findings are particularly pressing in light of the U.K. government’s annual Cyber Security Breaches Survey, which found that 43% of businesses reported suffering a cyber breach or attack in the past year.

In response to the escalating cybersecurity threats, the U.K. government announced £90 million in new funding aimed at bolstering cyber resilience. Additionally, officials are advancing the Cyber Security and Resilience Bill to protect essential services. They have urged organizations to prepare for a potential increase in newly discovered software vulnerabilities, as AI technologies like GPT-5.5 accelerate the pace at which security flaws can be identified and exploited.

The report’s findings raise critical questions about the future trajectory of AI development and its potential role in offensive cyber capabilities. AISI’s conclusions suggest that rapid advancements in reasoning, coding, and autonomous task execution may inadvertently contribute to the evolution of offensive cyber skills. If this trend continues, further advancements in AI-enhanced cyber capabilities could emerge quickly, posing significant risks to organizations and individuals alike.

Anthropic Plans Japan Expansion for Claude Mythos AI Amid U.S. Opposition

Anthropic expands Claude Mythos AI into Japan amid U.S. government scrutiny over potential national security risks and AI misuse concerns.

Staff2 hours ago

AI Generative

OpenAI Tests GPT 5.6 in Codex Update to Enhance AI Coding and Cybersecurity Features

OpenAI tests GPT 5.6 in Codex, aiming to enhance AI-driven coding efficiency and cybersecurity, potentially reshaping the developer landscape.

Staff2 hours ago

AI Cybersecurity

OpenAI’s GPT-5.5 Matches Claude Mythos in Cyberattack Tests, Reveals AISI Findings

OpenAI's GPT-5.5 outperformed Claude Mythos Preview in cyberattack simulations, achieving a 71.4% success rate in expert-level tasks, raising cybersecurity concerns.

Rachel Torres5 hours ago

AI Research

OpenAI Reveals Explainable AI for Detecting Machine-Generated Music in New Study

OpenAI introduces explainable AI techniques to detect machine-generated music, enhancing authenticity measures amid rising public concerns over AI's creative impact.

Staff16 hours ago

AI Generative

OpenAI’s GPT Image2 Generates MS Paint-Style Drawings, Sparking Viral Trend

OpenAI's GPT Image2 spurs a viral trend as users prompt AI to create MS Paint-style drawings from photos, challenging artistic norms and expectations.

Staff19 hours ago

AI Generative

Pinterest Reduces AI Budget While Adopting Hybrid Model with OpenAI and Alibaba

Pinterest slashes its AI budget by 90% while adopting a hybrid model with OpenAI and Alibaba, enhancing user experience and cost efficiency.

Staff23 hours ago

AI Cybersecurity

Anthropic’s Claude Model Discovers 1,000+ Vulnerabilities in Major Software Systems

Anthropic's Claude model has identified over 1,000 zero-day vulnerabilities in major software systems, revolutionizing cybersecurity and defense strategies.

Rachel Torres1 day ago

AI Regulation

AI Leaders Urge Self-Regulation Amid Competition Threatening Safety Standards

AI safety standards are at risk as Anthropic and OpenAI cut safety commitments amid competition, despite 80% of U.S. adults prioritizing regulation over innovation...

Staff2 days ago

AIPRESSA.COM

AI Cybersecurity

OpenAI’s GPT-5.5 Matches Claude Mythos in Cyberattack Efficiency, Solves Puzzles in 10 Minutes

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

Top Stories

Anthropic Plans Japan Expansion for Claude Mythos AI Amid U.S. Opposition

AI Generative

OpenAI Tests GPT 5.6 in Codex Update to Enhance AI Coding and Cybersecurity Features

AI Cybersecurity

OpenAI’s GPT-5.5 Matches Claude Mythos in Cyberattack Tests, Reveals AISI Findings

AI Research

OpenAI Reveals Explainable AI for Detecting Machine-Generated Music in New Study

AI Generative

OpenAI’s GPT Image2 Generates MS Paint-Style Drawings, Sparking Viral Trend

AI Generative

Pinterest Reduces AI Budget While Adopting Hybrid Model with OpenAI and Alibaba

AI Cybersecurity

Anthropic’s Claude Model Discovers 1,000+ Vulnerabilities in Major Software Systems

AI Regulation

AI Leaders Urge Self-Regulation Amid Competition Threatening Safety Standards