Connect with us

Hi, what are you looking for?

AI Cybersecurity

AI Still Relies on Human Oversight for Multi-Stage Cyberattacks, Report Reveals

AI’s reliance on human oversight in multi-stage cyberattacks persists, with a 15% rise in corporate AI system access highlighting urgent security risks.

The International AI Safety Report 2026 highlights that while general-purpose AI is increasingly automating various stages of cyberattacks, fully “autonomous attacks remain limited,” primarily due to the inability of AI systems to consistently manage complex, multi-stage attack sequences without human oversight. The report asserts that observed failure modes in AI include executing irrelevant commands, losing track of operational state, and an inability to recover from simple errors independently.

The report distinguishes between AI systems that can “assist” in the cyberattack chain—such as identifying targets, exploiting vulnerabilities, and generating malicious code—and those that can autonomously execute an entire operation. It further states that “fully autonomous, end-to-end attacks have not been reported,” emphasizing the current limitations of AI in cybersecurity.

Chris Anley, chief scientist at NCC Group, noted in a statement shared with TechInformed that the absence of fully autonomous cyberattacks does not eliminate the risks involved. He explained that attackers are already utilizing AI to identify vulnerabilities and create exploits, which enables more attacks with less technical expertise. Anley characterized AI-enabled attacks as the “new normal” and urged organizations to invest in faster detection methods, robust controls, and defensive AI to address the scale and speed of modern cyber threats.

Data from DARPA’s AI Cyber Challenge (AIxCC) provides a benchmark for performance in discrete security tasks. In a controlled environment during the August 2025 Final Competition, DARPA reported that competing AI systems discovered 54 unique synthetic vulnerabilities out of 63 challenges and successfully patched 43 of them. This final data reflects revisions from earlier preliminary figures concerning patch success, while the total number of discovered vulnerabilities remained unchanged. As detailed by the Congressional Research Service, the AIxCC aims to transition AI systems toward “machine speed” identification and patching strategies to bolster critical infrastructure against both human-led and AI-assisted threats.

The World Economic Forum’s Global Cybersecurity Outlook 2026 situates AI within a broader risk landscape that includes geopolitical fragmentation and uneven cyber capabilities. The report underscores AI’s dual role, enhancing defensive measures while simultaneously empowering more sophisticated attacks.

Incident datasets also indicate an increasing exposure to AI in cyber threats. Verizon’s 2025 Data Breach Investigations Report Executive Summary found “evidence” of generative AI usage by threat actors, as reported by the AI platforms themselves. It stated that the prevalence of synthetically generated text in malicious emails has doubled over the past two years. The report also revealed that 15% of employees accessed generative AI systems using corporate devices, with many employing non-corporate emails (72%) or corporate emails lacking integrated authentication (17%). Mandiant’s M-Trends 2025 further reported that exploits (33%), stolen credentials (16%), and phishing (14%) were the leading initial infection vectors in its 2024 investigations.

Research on agent reliability has also shed light on the brittleness of AI in long-horizon tasks. “The Agent’s Marathon,” published on OpenReview, highlights that large language model (LLM) agents “remain brittle” over extended tasks, with performance deteriorating rapidly. Meanwhile, the authors of the Agent Security Bench (ASB) identify that LLM-based agents could introduce “critical security vulnerabilities” and propose ASB as a framework to benchmark attacks and defenses across various scenarios, tools, and methods. A 2025 survey paper on ScienceDirect frames “LLM-based agents” as a distinct area for both attacks and defenses, suggesting evaluation criteria for assessing their effectiveness.

The 2026 report notes a shift toward “Frontier AI Safety Frameworks” as a key method for managing risks associated with AI. These frameworks increasingly depend on “if-then” safety commitments, which stipulate specific capability thresholds that, once reached by a model, activate mandatory safety mitigations. This approach aims to address the “evidence dilemma,” a challenge in formulating policy when the pace of AI advancements outstrips the scientific understanding of their associated risks. The report indicates that the number of companies publishing voluntary safety frameworks has doubled since last year, though it cautions that “real-world evidence of their effectiveness remains limited.” To bolster safety layers further, the report advocates a “defense-in-depth” strategy, which integrates technical safeguards, system-level monitoring, and organizational risk processes to prevent systemic breaches resulting from failures in any single control.

See also
Rachel Torres
Written By

At AIPressa, my work focuses on exploring the paradox of AI in cybersecurity: it's both our best defense and our greatest threat. I've closely followed how AI systems detect vulnerabilities in milliseconds while attackers simultaneously use them to create increasingly sophisticated malware. My approach: explaining technical complexities in an accessible way without losing the urgency of the topic. When I'm not researching the latest AI-driven threats, I'm probably testing security tools or reading about the next attack vector keeping CISOs awake at night.

You May Also Like

AI Business

Red Hat advances enterprise AI with Small Language Models that achieve over 98% validity in structured tasks, prioritizing reliability and data sovereignty.

AI Research

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

AI Regulation

Korea Venture Investment Corp. unveils AI-driven fund management systems by integrating Nvidia H200 GPUs to enhance efficiency and support unicorn growth.

AI Technology

Apple raises Mac mini starting price to $799 amid AI-driven inventory shortages, eliminating the $599 model in response to surging demand for advanced computing.

AI Research

IBM launches a Chicago Quantum Hub to create 750 AI jobs and expands its MIT partnership to advance quantum computing and AI integration.

AI Government

71% of Australian employees use generative AI daily, but only 36% trust its implementation, highlighting urgent calls for better policy frameworks and safeguards.

AI Regulation

The Academy of Motion Picture Arts and Sciences bars AI performances from Oscar eligibility, emphasizing human-authored content amid rising industry tensions over generative AI's...

AI Tools

Workday's stock jumps 3.73% to $126.96 amid AI product updates and earnings optimism, yet analysts cite a 49.8% undervaluation risk at $253.14.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.