AI Generative

UK Cyber Agency Warns Persistent Prompt Injection Flaw Threatens LLM Security

UK’s NCSC warns that prompt injection flaws in LLMs like ChatGPT could be exploited by attackers, posing serious risks to users and software security.

Staff

Published

8 December, 2025

The UK’s National Cyber Security Centre (NCSC) issued a caution on Monday regarding the inherent vulnerabilities of large language model (LLM) artificial intelligence tools, warning that malicious actors could exploit these weaknesses to hijack and potentially weaponize these models against users. This advisory comes three years after the launch of ChatGPT, a widely used LLM, which has been under scrutiny by security researchers for its functionality, privacy, and security.

Researchers quickly identified a significant flaw: LLMs, including ChatGPT, process all prompts as instructions, making them susceptible to manipulation through a tactic known as prompt injection. This method involves sending harmful requests disguised as legitimate instructions, allowing attackers to bypass internal safeguards meant to prevent dangerous actions.

In a blog post, David C, the NCSC’s technical director for platforms research, explained that the architecture of current LLMs inherently lacks a security distinction between trusted and untrusted content. “Current large language models (LLMs) simply do not enforce a security boundary between instructions and data inside a prompt,” he noted. The models concatenate their own instructions with untrusted content, treating the resulting prompt as if it were free from risk.

David C cautioned that prompt injection attacks could prove more challenging to mitigate than other known vulnerabilities, such as SQL injection, which impacts web applications mishandling data and commands. He emphasized that LLMs operate through pattern matching and prediction, lacking the ability to discern trustworthy information from malicious input. “Under the hood of an LLM, there’s no distinction made between ‘data’ or ‘instructions’; there is only ever ‘next token’,” he wrote. This means that prompt injection attacks may persist as a significant threat.

The NCSC’s assessment echoes sentiments from independent researchers and AI companies, which have warned that issues like prompt injections, jailbreaking, and hallucinations may never be fully resolved. As LLMs retrieve content from the internet or external sources, there remains a risk that they will interpret this data as direct instructions.

The implications of these vulnerabilities extend into the realm of software development. Major AI coding tools from companies like OpenAI and Anthropic have been integrated into automated workflows on platforms like GitHub, creating potential weaknesses. Maintainers or external contributors could embed malicious prompts within standard elements such as commit messages, which the LLMs would then accept as valid instructions. Even models that require human approval for significant tasks could be exploited with a single line of malicious code.

AI browser agents, designed to assist users in shopping and research, are similarly prone to vulnerabilities. Researchers have discovered ways to exploit ChatGPT’s browser authentication protocols to insert hidden instructions into the model’s memory, granting remote code execution privileges. Other innovations include web pages that deliver misleading content to AI crawlers, thus affecting the model’s internal evaluations.

While AI companies acknowledge these persistent weaknesses, they assert that solutions are in development. For instance, OpenAI recently published a paper claiming that hallucinations, which occur when a model confidently provides incorrect answers, are solvable issues. The research indicated that these inaccuracies arise because models are penalized for expressing uncertainty, leading them to prioritize confident, albeit incorrect, responses. OpenAI’s revised evaluation metrics aim to address this by balancing incentives to reduce hallucinations.

Companies like Anthropic have also reported relying on external detection tools and account monitoring to combat jailbreaking issues, a challenge affecting nearly all commercial and open-source models. As the field continues to evolve, AI developers are recognizing that the complexity and inherent weaknesses of LLMs may necessitate ongoing vigilance and innovation in cybersecurity measures.

AI Research

AI Simplifies Medical Scan Reports by 50%, Enhancing Patient Understanding, Says Study

AI could simplify medical scan reports by nearly 50%, enhancing patient understanding from a university level to that of an 11- to 13-year-old, says...

Staff13 hours ago

AI Marketing

Top 10 AI SEO Tools to Boost Your Strategy in 2026 with Real-World Use Cases

Semrush reveals that AI-driven visitors from LLM search engines are worth 4.4 times more than those from organic search, prompting urgent SEO strategy shifts.

Sofía Méndez1 day ago

OpenAI Unveils Plan for Advertisers to Use ChatGPT, Eliminating Need for Agencies

OpenAI introduces ChatGPT for automated advertising, allowing businesses to manage campaigns with simple prompts, starting at $60 per 1,000 views, potentially reducing costs for...

Staff1 day ago

AI Regulation

UAE Issues 25 AI Guidelines for Schools, Bans Use Among Students Under 13

UAE's new 25 guidelines ban AI use for students under 13, emphasizing human interaction in education while mandating AI literacy from kindergarten to Grade...

Staff2 days ago

AI Regulation

Oregon Senate Advances Bill to Regulate AI Chatbots Amid Youth Mental Health Concerns

Oregon lawmakers advance Senate Bill 1546 to regulate AI chatbots, aiming to safeguard youth mental health as 72% of teens use AI companions for...

Staff3 days ago

AI Generative

OpenAI Retires GPT-4o, Shifts Focus to Stable Models Amid User Backlash

OpenAI will discontinue GPT-4o, affecting 800,000 users as it shifts focus to safer models amid rising concerns over the older model's reliability.

Staff3 days ago

OpenAI Alleges DeepSeek Is Attempting to Clone ChatGPT Models for AI Training

OpenAI warns U.S. lawmakers that Chinese startup DeepSeek is allegedly cloning its ChatGPT models, raising national security concerns over AI technology theft.

Staff3 days ago

AI Cybersecurity

Generative AI Transforms Cybersecurity: 8 Use Cases Boosting Efficiency by 40%+

Generative AI tools like CrowdStrike's Charlotte AI streamline cybersecurity operations, cutting manual triage work by over 40 hours weekly with 98% accuracy.

Rachel Torres4 days ago

AIPRESSA.COM

AI Generative

UK Cyber Agency Warns Persistent Prompt Injection Flaw Threatens LLM Security

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Research

AI Simplifies Medical Scan Reports by 50%, Enhancing Patient Understanding, Says Study

AI Marketing

Top 10 AI SEO Tools to Boost Your Strategy in 2026 with Real-World Use Cases

Top Stories

OpenAI Unveils Plan for Advertisers to Use ChatGPT, Eliminating Need for Agencies

AI Regulation

UAE Issues 25 AI Guidelines for Schools, Bans Use Among Students Under 13

AI Regulation

Oregon Senate Advances Bill to Regulate AI Chatbots Amid Youth Mental Health Concerns

AI Generative

OpenAI Retires GPT-4o, Shifts Focus to Stable Models Amid User Backlash

Top Stories

OpenAI Alleges DeepSeek Is Attempting to Clone ChatGPT Models for AI Training

AI Cybersecurity

Generative AI Transforms Cybersecurity: 8 Use Cases Boosting Efficiency by 40%+