Qwen: Qwen3 Max Thinking

Alibaba·Released 2026-02-09

262K ctx

About

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

Quality Index

31.7

92nd of 537

Top 17%

Coding Index

30.5

144th of 447

Top 32%

Price/1M

$1.56

490th cheapest

187% above median

Top 72%

Speed

44 tok/s

Top 90%

TTFT

1.45s

Context Window

262K

110th largest

Top 38%

Market Position

Qwen: Qwen3 Max ThinkingMarket Average

Pricing

Input

$0.78

per 1M tokens

Output

$3.90

per 1M tokens

Blended

$1.56

per 1M tokens

Cheaper than 28% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$1.56

Monthly

$46.80

vs. Similar Models

Qwen: Qwen3.6 35B A3BQ:-0.1

$0.35-77%

Qwen3.5 397B A17B (Non-reasoning)Q:+0.3

$1.35-13%

MiniMax: MiniMax M2.1Q:-0.3

$0.45-71%

DeepSeek V4 Pro (Non-reasoning)Q:-0.5

$0.54-65%

Performance

tokens/sec

Faster than 10% of models

1.45

seconds

Faster than 35% of models

46.83

seconds

Faster than 7% of models

Market Median

94 tok/s

53% slower

Median TTFT

1.11s

30% slower

Throughput/Dollar

tok/s per $/1M

Speed Comparison

MiMo-V2-Pro

44 tok/s+1%

QwQ 32B-Preview

43 tok/s-2%

DeepSeek R1 Distill Qwen 32B

43 tok/s-3%

Context Window

262K

tokens

Larger than 62% of models

Max Output

33K

tokens

13% of context

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

86.1%

HLE

26.2%

LiveCodeBenchNot evaluated

SciCode

43.1%

TerminalBench Hard

24.2%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

70.7%

Long Context Recall

66.0%

Tau2

83.6%

Market AverageTop Score

Quick Compare

Similar Models

Qwen: Qwen3.6 35B A3B

Alibaba

Q: 31.6$0.35/1M262K ctx

Faster: 234%Cheaper: 77%

MiniMax: MiniMax M2.1

MiniMax

Q: 31.4$0.45/1M205K ctx

Faster: 410%Cheaper: 71%

Qwen3.5 397B A17B (Non-reasoning)

Alibaba

Q: 32.0$1.35/1M

Faster: 20%Cheaper: 13%

GPT-5 (low)

OpenAI

Q: 31.2$3.44/1M

Faster: 80%Pricier: 120%

MiMo-V2-Flash (Reasoning)

Xiaomi

Q: 31.2$0.15/1M

Faster: 107%Cheaper: 90%

DeepSeek V4 Pro (Non-reasoning)

DeepSeek

Q: 31.2$0.54/1M

Faster: 90%Cheaper: 65%

Compare all 7 models

Qwen: Qwen3 Max Thinking

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models

Market Position