Why Chinese AI Now Costs 30 Times Less Than American Models

Markets 2026-05-28 16:30

Why Chinese AI Now Costs 30 Times Less Than American Models

Two of China's most capable AI labs cut their model prices to a fraction of Western rivals this week, while OpenAI and Anthropic moved the other way.

Key Points:

  • DeepSeek made its 75% V4-Pro discount permanent on May 22, fixing output at $0.87 per million tokens.
  • Xiaomi cut MiMo-V2.5 prices by up to 99% on May 26, with cached Pro inputs as low as $0.0036 per million tokens.
  • OpenAI lifted GPT-5.5 output to $30 per million tokens, widening the gap with Chinese frontier models.

DeepSeek, Xiaomi Cut Rates

DeepSeek confirmed on May 22 that a temporary 75% discount on its V4-Pro model would become permanent, fixing output at $0.87 per million tokens and input at $0.435.

The promotion had been set to expire May 31.

Days later, fellow Chinese lab Xiaomi slashed MiMo-V2.5 rates by up to 99% for cached inputs, effective May 27, with the Pro tier's cache hits priced as low as $0.0036 per million tokens.

By contrast, GPT-5.5 from OpenAI doubled its predecessor's output rate to $30 per million tokens. Claude Opus 4.7 from Anthropic lists $5 input and $25 output.

Also Read: Anthropic Moves Restricted Claude Mythos Model Closer To Public Release

Engineers Defend the Math

Xiaomi also rebuilt its token plans. The $100 Max plan now grants 82 billion tokens, up from 1.6 billion, with the same money buying five to eight times more usage than before.

Fuli Luo, who leads Xiaomi's MiMo team and once co-built DeepSeek-V2, tied the cuts to a smarter way of storing and reusing data the model has already processed.

That approach trims computing demand sharply.

Luo argued the lab can run near full capacity at the new rates and still cover its costs, which suggests the pricing reflects real efficiency gains rather than a loss-leading promotion.

The savings matter most for production tasks that reuse the same context. Agent pipelines with stable prompts, document processors, and retrieval tools all hit cache constantly, so cheaper cached input cuts the running bill directly.

Western labs face a different bind. OpenAI's pivot toward consumer features and advertising hints that token revenue alone may not carry its valuation.

Why the Gap Keeps Widening

DeepSeek and Xiaomi did not open this contest. Chinese models already undercut American rivals before either announcement landed.

MiniMax M2.7 runs at $0.30 input and $1.20 output per million tokens. Kimi K2.5 from Moonshot AI sits at $0.60 and $2.50.

Analysts tracking cost against benchmark performance peg the Q2 2026 price-to-quality gap between Chinese and American frontier models at roughly 15 to 30 times, before any cache discounts. This week's reductions narrow that gap further for the repetitive workloads that dominate real deployments.

The pattern echoes early 2025, when DeepSeek's low-cost releases rattled markets and forced Western providers to defend their pricing. A year on, the pressure has only intensified, and the response from American labs has been to hold or raise rates rather than chase the floor.

Read Next: Ethereum Network Empties Out As Staking Locks A Record 32% Of Supply

Share to:

This content is for informational purposes only and does not constitute investment advice.

Curated Series

SuperEx Popular Science Articles Column

SuperEx Popular Science Articles Column

This collection features informative articles about SuperEx, aiming to simplify complex cryptocurrency concepts for a wider audience. It covers the basics of trading, blockchain technology, and the features of the SuperEx platform. Through easy-to-understand content, it helps users navigate the world of digital assets with confidence and clarity.

Unstaked related news and market dynamics research

Unstaked related news and market dynamics research

Unstaked (UNSD) is a blockchain platform integrating AI agents for automated community engagement and social media interactions. Its native token supports governance, staking, and ecosystem features. This special feature explores Unstaked’s market updates, token dynamics, and platform development.

XRP News and Research

XRP News and Research

This series focuses on XRP, covering the latest news, market dynamics, and in-depth research. Featured analysis includes price trends, regulatory developments, and ecosystem growth, providing a clear overview of XRP's position and potential in the cryptocurrency market.

How do beginners trade options?How does option trading work?

How do beginners trade options?How does option trading work?

This special feature introduces the fundamentals of options trading for beginners, explaining how options work, their main types, and the mechanics behind trading them. It also explores key strategies, potential risks, and practical tips, helping readers build a clear foundation to approach the options market with confidence.

What are the risks of investing in cryptocurrency?

What are the risks of investing in cryptocurrency?

This special feature covers the risks of investing in cryptocurrency, explaining common challenges such as market volatility, security vulnerabilities, regulatory uncertainties, and potential scams. It also provides analysis of risk management strategies and mitigation techniques, helping readers gain a clear understanding of how to navigate the crypto market safely.