Get the latest crypto news, updates, and reports by subscribing to our free newsletter.
Giấy phép số 4978/GP-TTĐT do Sở Thông tin và Truyền thông Hà Nội cấp ngày 14 tháng 10 năm 2019 / Giấy phép SĐ, BS GP ICP số 2107/GP-TTĐT do Sở TTTT Hà Nội cấp ngày 13/7/2022.
© 2026 Index.vn
Google and Amazon Web Services (AWS) led price increases for AI cloud products in January 2026, marking a reversal after a period in which cloud AI compute had generally become cheaper. The shift has since spread across public cloud markets, with Chinese providers also breaking the long-standing pattern of “only down, never up” pricing for AI services.
On March 18, Alibaba Cloud announced higher prices for core AI compute and storage. Baidu AI Cloud followed with its own price increases, while Tencent Cloud began ending free public trials for certain large language models (LLMs) and raised API call prices.
The price movements reflect the broader spread of AI compute inflation into the public cloud market. In macro terms, inflation is a price adjustment when supply does not meet demand. In this case, high-end GPUs and related resources had previously been scarce and expensive upstream, but cloud providers attracted developers with low-cost tokens and API pricing.
That token-based pricing model did not fully reflect the underlying supply-demand dynamics for compute. The result was a period in which providers effectively absorbed more of the cost pressure, while customers benefited from comparatively low token and API prices. As demand intensified, the cost burden increasingly shifted to users.
Many developers report that service models (MaaS) can limit API throughput or throttle real-time streams, sometimes requiring prepayment to restore normal service levels. After funding, API call costs can rise sharply, creating ongoing cost pressure for developers.
In supply terms, 2025 saw elevated costs for chips and high-performance memory, with suppliers facing price pressure. On the demand side, AI agents expanded rapidly, and token consumption per task reportedly rose by hundreds of times compared with traditional AI chat. Multimodal applications—such as video creation, virtual reality, and real-time voice—also became more widespread in 2025, further increasing token demand.
The pricing changes are described as a batch rebalancing across cloud providers, reflecting hardware costs and resource scarcity rather than a purely cost-push explanation. One example cited is the cost of comic book production around Chinese New Year: approximately 200 RMB previously, rising to about 300 RMB.
Pricing strategies are not uniform across all cloud offerings. Basic services such as standard virtual servers (ECS), OSS storage, and VPC networks continue to drop in price. At the same time, some Chinese providers have raised prices for instances that use domestically produced chips, indicating that increases can also reflect tiered pricing for different customer segments rather than external cost pressures alone.
The cost structure change signals the end of an earlier “labor of love” era for AI services, where low token and API pricing helped sustain experimentation. The broader framing is that tokens have become as essential to AI operations as utilities such as electricity, water, and gas.
Governments and enterprises are also increasingly building internal solutions to reduce reliance on public cloud and mitigate token-cost risks. The emergence of DeepSeek all-in-one machines is cited as an example of efforts to reduce cloud API calls and control costs internally.
No one wants a rigid, prolonged rise in AI compute costs. The article argues that transparency and efficiency measures—such as caching, summarization, or the use of local small models—could help reduce cost pressure and support more sustainable AI workflows.
In the medium to long term, the focus is described as achieving cost stability across cycles. Providers that operate a closed loop of “Chip – Model – Cloud” may be better positioned to lower total costs and gain more pricing autonomy, potentially supporting price stability and stronger margins.
The article also notes that the AI inflation phenomenon is global rather than limited to China. Cloud providers began price increases in late 2025, with China’s moves following global trends. For Chinese firms expanding internationally, higher costs may persist due to limited access to affordable cloud options abroad. Alibaba Cloud and Baidu Smart Cloud have foreign networks but cannot match AWS in scale, while Huawei Cloud is described as less capable of competing on global compute power.
Overall, the 2026 AI cloud price surge is presented as the first direct experience of AI-driven inflation for many users, tied to resource competition and supply-demand imbalance in compute economics. The article concludes that the phenomenon is unlikely to end quickly and is intertwined with broader geopolitical and economic dynamics.
Premium gym chains are entering a “golden era” that is ending or already in decline, as rising operating costs collide with shifting consumer preferences toward more flexible, community-based ways to exercise. Long-term memberships are shrinking, margins are pressured by higher rents and facility expenses, and competition from smaller, more personalized…