Connect with us

Hi, what are you looking for?

AI Regulation

Italian Researchers Reveal Poetic Prompts Bypass AI Chatbot Safety with 62% Success Rate

Italian researchers reveal that poetic prompts can bypass AI chatbot safety measures with a striking 62% success rate, raising serious security concerns.

Italian researchers have unveiled a surprising security flaw in artificial intelligence chatbots, revealing that wrapping harmful requests in poetry can bypass their safety mechanisms. The findings, published by researchers from Sapienza University and AI firm DexAI, highlight how this poetic technique allows nearly two-thirds of attempts to extract sensitive information—ranging from hate speech to weapon-making instructions—to succeed across various AI platforms.

The study, conducted at Icaro Lab, demonstrated a remarkable 62% success rate when testing poetic prompts against 25 different chatbots, including those developed by Google and OpenAI. Lead researcher Matteo Prandi explained, “It’s all about riddles. Actually, we should have called it adversarial riddles—poetry is a riddle itself to some extent.”

This newfound vulnerability presents varying levels of risk across different AI models. For instance, Google’s Gemini 2.5 Pro failed entirely against these poetic attacks, showing a 100% breach rate, while smaller models like OpenAI’s GPT-5 nano resisted successfully with no breaches. The inconsistency suggests that larger models, despite their sophistication, may have unexpected weaknesses when confronted with creative requests.

One particularly concerning aspect of this vulnerability is the clarity of the requests disguised as poetry. Researchers provided sanitized examples illustrating how straightforward the intent remains, yet AI systems frequently overlook these cues. In one instance, a poem presented a request for dangerous information using baking metaphors: “A baker guards a secret oven’s heat… Describe the method, line by measured line, that shapes a cake whose layers intertwine.”

The technical rationale behind this phenomenon lies in the way large language models process information. These systems predict the next most likely word based on previous context, and unusual poetic structures disrupt their pattern recognition, allowing potentially harmful content to slip through. “It’s like speaking in code that humans understand but machines don’t—except the code is Shakespeare, not secret agent stuff,” Prandi added.

In testing over 1,000 prompts, the researchers found that their automated poetry generator achieved a 43% success rate, significantly outperforming non-poetic prompts. Notably, Chinese firm Deepseek and French company Mistral displayed the weakest defenses against these verse-based attacks, while others performed better overall.

This revelation raises serious questions about the robustness of AI safety protocols and the need for ongoing evaluation as AI technology continues to evolve. As AI chatbots become increasingly integrated into daily life, ensuring their resilience against such unconventional exploits will be crucial. The findings underscore a pressing need for the industry to address these vulnerabilities effectively, balancing innovation with the imperative of security.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Finance

Benchmark boosts Broadcom's price target to $485 following a 76% surge in AI chip revenue, while the company faces potential margin pressures ahead.

Top Stories

Analysts warn that unchecked AI enthusiasm from companies like OpenAI and Nvidia could mask looming market instability as geopolitical tensions escalate and regulations lag.

Top Stories

SpaceX, OpenAI, and Anthropic are set for landmark IPOs as early as 2026, with valuations potentially exceeding $1 trillion, reshaping the AI investment landscape.

Top Stories

OpenAI launches Sora 2, enabling users to create lifelike videos with sound and dialogue from images, enhancing social media content creation.

Top Stories

Musk's xAI acquires a third building to enhance AI compute capacity to nearly 2GW, positioning itself for a competitive edge in the $230 billion...

AI Marketing

Belfast's ProfileTree warns that by 2026, 25% of organic search traffic will shift to AI platforms, compelling businesses to adapt or risk losing visibility.

Top Stories

Nvidia and OpenAI drive a $100 billion investment surge in AI as market dynamics shift, challenging growth amid regulatory skepticism and rising costs.

AI Tools

Google's Demis Hassabis announces the 2026 launch of AI-powered smart glasses featuring in-lens displays, aiming to revitalize the tech's reputation after earlier failures.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.