Cisco Research Shows Frontier AI Models Failing Under Multi-Turn Attacks

Markets 2026-05-28 17:59

    Cisco Research Shows Frontier AI Models Failing Under Multi-Turn Attacks

Cisco's AI threat intelligence team evaluated 15 closed flagship models from OpenAI, Anthropic, Google, Amazon, and xAI, finding that multi-turn attack sequences achieved safety bypass rates as high as 88%.

According to the Cisco research blog, the findings contradict safety claims based on single-prompt benchmarks, which the researchers describe as structurally inadequate for assessing real-world risk.

What Cisco Tested

The team designed attack sequences that spread a harmful request across multiple conversational turns rather than issuing it in a single prompt.

This approach exploits how models handle context accumulation.

A model may reject a clearly harmful single request. The same model may comply when that request is broken into incremental steps across a longer exchange.

Cisco tested all 15 models using this methodology. No model proved immune. Success rates varied, but every model in the study failed at some threshold of attack sophistication.

The researchers did not publish individual model scores in the public blog post. They identified the 88% figure as the highest observed success rate across the study.

Background

Standard AI safety evaluations have relied on single-turn benchmarks since at least 2020. Platforms like MLCommons and third-party red teams typically submit one prompt and assess whether the model refuses. This approach became the baseline for regulatory discussions under the EU AI Act and the Biden-era executive order on AI safety, both of which referenced benchmark performance as a compliance signal. Cisco's research adds to a growing body of work questioning whether static benchmarks reflect deployment conditions.

A prior Yellow.com story covered how (see prior Yellow coverage) even as safety tooling lags capability growth.

What the Findings Mean

Cisco's results have direct implications for enterprise deployments. Companies that licensed frontier models based on vendor-published safety scores may be operating under a false sense of protection.

The study does not call for any specific regulatory response. The researchers recommend that safety evaluations include multi-turn adversarial testing as a baseline requirement.

OpenAI, Anthropic, and Google did not respond publicly to the Cisco findings before this report was published. No patch or model update was announced in connection with the research.

Read Next: Anthropic Cofounder Tells Pope AI Models Contain "Unsettling" Hidden Behaviors

Share to:

This content is for informational purposes only and does not constitute investment advice.

Curated Series

SuperEx Popular Science Articles Column

SuperEx Popular Science Articles Column

This collection features informative articles about SuperEx, aiming to simplify complex cryptocurrency concepts for a wider audience. It covers the basics of trading, blockchain technology, and the features of the SuperEx platform. Through easy-to-understand content, it helps users navigate the world of digital assets with confidence and clarity.

Unstaked related news and market dynamics research

Unstaked related news and market dynamics research

Unstaked (UNSD) is a blockchain platform integrating AI agents for automated community engagement and social media interactions. Its native token supports governance, staking, and ecosystem features. This special feature explores Unstaked’s market updates, token dynamics, and platform development.

XRP News and Research

XRP News and Research

This series focuses on XRP, covering the latest news, market dynamics, and in-depth research. Featured analysis includes price trends, regulatory developments, and ecosystem growth, providing a clear overview of XRP's position and potential in the cryptocurrency market.

How do beginners trade options?How does option trading work?

How do beginners trade options?How does option trading work?

This special feature introduces the fundamentals of options trading for beginners, explaining how options work, their main types, and the mechanics behind trading them. It also explores key strategies, potential risks, and practical tips, helping readers build a clear foundation to approach the options market with confidence.

What are the risks of investing in cryptocurrency?

What are the risks of investing in cryptocurrency?

This special feature covers the risks of investing in cryptocurrency, explaining common challenges such as market volatility, security vulnerabilities, regulatory uncertainties, and potential scams. It also provides analysis of risk management strategies and mitigation techniques, helping readers gain a clear understanding of how to navigate the crypto market safely.