Claude Mythos Solves 32-Step AISI Hack In 6 Of 10 Attempts

Markets 2026-05-16 00:01

Claude Mythos Solves 32-Step AISI Hack In 6 Of 10 Attempts

A new checkpoint of Anthropic's Claude Mythos Preview has become the first AI model to solve both UK government cyberattack simulations, raising fresh questions about autonomous hacking.

AISI Reports Mythos Breakthrough

The UK's AI Security Institute reported Wednesday that the newer Mythos checkpoint completed its 32-step corporate network attack range, "The Last Ones," in 6 of 10 attempts. The earlier version had managed just 3 of 10.

The updated model also cracked "Cooling Tower," an industrial control system range that no prior model had passed, in 3 of 10 tries.

Rival OpenAI's GPT-5.5 was tested on the same exercise. It solved "The Last Ones" in 3 of 10 attempts but did not complete "Cooling Tower."

AISI ran the ranges with a 100 million-token compute budget per attempt, and the agency noted that performance kept scaling at that ceiling, suggesting higher budgets would push success rates further.

Also Read: Southeast Asia Blockchain Week Brings Ripple, Avalanche, Solana Foundation, And K-Pop To Bangkok

Doubling Time Keeps Shrinking

AISI tracks cyber progress through time horizon benchmarks, measuring how long an autonomous task a model can finish at 80% reliability. In November 2025, the agency estimated a doubling time of 8 months. By February 2026, that figure had compressed to 4.7 months, and both Mythos and GPT-5.5 have since exceeded the faster trend.

The agency acknowledged uncertainty about whether the latest results signal a new acceleration or a one-time leap.

Research nonprofit METR, which tracks AI on software tasks rather than cyber ranges, has produced a similar figure of roughly 4.2 months. AISI said the convergence strengthens the case that the trend reflects real capability gains rather than a quirk of one evaluation suite.

The institute stressed that its ranges lack active defenders, so the results show what models can do against weakly protected networks rather than hardened enterprise systems.

Why Capability Jumps Matter

The newer Mythos checkpoint did not arrive with a fresh model release. AISI used the same version Anthropic deployed last month with Project Glasswing, its security partnership program, after receiving an updated build of the same model.

"Notable capability jumps do not always require new model releases," the institute wrote. That cuts against the assumption that defenders can pace themselves to launch cycles.

Anthropic introduced Mythos Preview on Apr. 7, framing the model as a turning point for the security industry after it identified zero-day flaws across major operating systems and browsers in internal tests. The company said it had withheld broader release because of those capabilities, and AISI's earlier April evaluation flagged Mythos as a clear step up from previous frontier systems.

Read Next: Gemini Space Station Hit By Multiple Securities Fraud Claims After IPO

Share to:

This content is for informational purposes only and does not constitute investment advice.

Curated Series

SuperEx Popular Science Articles Column

SuperEx Popular Science Articles Column

This collection features informative articles about SuperEx, aiming to simplify complex cryptocurrency concepts for a wider audience. It covers the basics of trading, blockchain technology, and the features of the SuperEx platform. Through easy-to-understand content, it helps users navigate the world of digital assets with confidence and clarity.

Unstaked related news and market dynamics research

Unstaked related news and market dynamics research

Unstaked (UNSD) is a blockchain platform integrating AI agents for automated community engagement and social media interactions. Its native token supports governance, staking, and ecosystem features. This special feature explores Unstaked’s market updates, token dynamics, and platform development.

XRP News and Research

XRP News and Research

This series focuses on XRP, covering the latest news, market dynamics, and in-depth research. Featured analysis includes price trends, regulatory developments, and ecosystem growth, providing a clear overview of XRP's position and potential in the cryptocurrency market.

How do beginners trade options?How does option trading work?

How do beginners trade options?How does option trading work?

This special feature introduces the fundamentals of options trading for beginners, explaining how options work, their main types, and the mechanics behind trading them. It also explores key strategies, potential risks, and practical tips, helping readers build a clear foundation to approach the options market with confidence.

What are the risks of investing in cryptocurrency?

What are the risks of investing in cryptocurrency?

This special feature covers the risks of investing in cryptocurrency, explaining common challenges such as market volatility, security vulnerabilities, regulatory uncertainties, and potential scams. It also provides analysis of risk management strategies and mitigation techniques, helping readers gain a clear understanding of how to navigate the crypto market safely.