Alibaba unveils new contender in AI language model

January 29, 2026 This week, Alibaba Cloud’s Qwen Team unveiled Qwen3-Max-Thinking, a proprietary language model designed to compete directly with top-tier reasoning systems from U.S. and European labs. The Chinese AI lab says the model can match and in some benchmarks, surpass the reasoning performance of OpenAI’s GPT-5.2 and Google’s Gemini 3 Pro, while running at a fraction of the cost.

The release lands at a pivotal moment. Western firms have largely defined what “reasoning models” look like, often described as System-2 style logic for multi-step problem solving. Qwen’s latest results suggest that gap has narrowed or, in some cases, closed.

Qwen is already a familiar name in AI circles. Over the past year, the team has shipped a wide range of open-source models spanning text, image, and speech, earning adoption well beyond China. The models even drew praise from Brian Chesky, who previously said Airbnb was using Qwen’s free, open models as a cost-effective alternative to U.S.-based offerings.

Unlike earlier releases, Qwen3-Max-Thinking is proprietary. According to the Qwen Team, the goal was not incremental improvement but architectural leverage.

At the core of the model is a technique the team calls “test-time scaling,” which allows the system to spend more compute on difficult problems while avoiding redundant reasoning. Instead of generating many answers and selecting the best one, Qwen3-Max-Thinking iteratively refines its reasoning, identifying dead ends early and focusing processing power on unresolved uncertainties.

Alibaba Cloud says this approach delivers measurable gains. On GPQA, a PhD-level science benchmark, the model improved its score from 90.3 to 92.8. On LiveCodeBench v6, performance rose from 88.0 to 91.4. In agentic tasks that combine reasoning with live web search, Qwen3-Max-Thinking also outperformed competing models on “Humanity’s Last Exam,” a benchmark designed to resist memorization.

Beyond raw reasoning, the model integrates autonomous tool use. It can decide when to search the web, run code, or retrieve stored context without explicit user prompts.

Pricing appears to be a key part of the strategy. Alibaba Cloud has positioned Qwen3-Max-Thinking below many flagship Western models on a per-token basis, while charging separately for advanced agent features such as web search and agent orchestration. Some tools, including code execution and web extraction, are currently free for a limited time.

Alibaba unveils new contender in AI language model

Top Stories

OpenAI brings in Smartly to shape how ads work inside ChatGPT

Quantum breakthrough researchers cut required qubits from millions to thousands

Anthropic coding tool exposes full source code again after packaging error

Cisco breach exposes 300+ repos after supply chain attack

OpenAI cuts Sora with $1M-a-day costs to free up resources for core AI models

AI boom faces new constraint as helium disruption slows chip production

Related Articles

NYC hospital chief says AI could take over some radiology work

OpenAI brings in Smartly to shape how ads work inside ChatGPT

Quantum breakthrough researchers cut required qubits from millions to thousands

AMD moves to acquire Intel in all-stock deal

Staff Writer

Staff Writer

Jim Love

Follow Us

Popular categories

Tech News Delivered