QwQ 32B

Alibaba·Released 2025-03-05

Open Source131K ctx

About

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...

Related Models

QwQ 32B-Preview2024-11-27

Quality Index

13.4

272nd of 537

Top 51%

Math Index

29.0

188th of 269

Top 70%

Price/1M

$0.74

377th cheapest

37% above median

Top 55%

Speed

32 tok/s

Top 98%

TTFT

0.47s

Context Window

131K

236th largest

Top 73%

Market Position

QwQ 32BMarket Average

Pricing

Input

$0.66

per 1M tokens

Output

$1.00

per 1M tokens

Blended

$0.74

per 1M tokens

Cheaper than 45% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.74

Monthly

$22.35

vs. Similar Models

Qwen3 235B A22B (Reasoning)Q:0.0

$2.63+252%

Qwen3 VL 30B A3B (Reasoning)Q:-0.1

$0.34-55%

Qwen: Qwen3 Coder 30B A3B InstructQ:+0.2

$0.12-84%

Gemma 4 12B (Non-reasoning)Q:-0.2

$0.15-80%

Performance

tokens/sec

Faster than 2% of models

0.47

seconds

Faster than 90% of models

77.77

seconds

Faster than 2% of models

Market Median

94 tok/s

66% slower

Median TTFT

1.11s

58% faster

Throughput/Dollar

tok/s per $/1M

Speed Comparison

Qwen3.5 0.8B (Non-reasoning)

30 tok/s-6%

OpenAI: o3 Pro

34 tok/s+6%

Gemma 3 4B Instruct

34 tok/s+6%

Context Window

131K

tokens

Larger than 27% of models

Max Output

131K

tokens

100% of context

Benchmarks

MMLU-Pro

76.4%

GPQA Diamond

59.3%

HLE

8.2%

LiveCodeBench

63.1%

SciCode

35.8%

TerminalBench HardNot evaluated

MATH-500

95.7%

AIME

78.0%

AIME 2025

29.0%

IFBench

38.8%

Long Context Recall

25.0%

Tau2Not evaluated

Market AverageTop Score

Open Source

Quick Compare

Similar Models

Qwen3 235B A22B (Reasoning)

Alibaba

Q: 13.4$2.63/1M

Faster: 105%Pricier: 252%

Gemini 2.0 Flash Thinking Experimental (Jan '25)

Google

Q: 13.3N/A/1M

Qwen3 VL 30B A3B (Reasoning)

Alibaba

Q: 13.3$0.34/1M

Faster: 286%Cheaper: 55%

Qwen: Qwen3 Coder 30B A3B Instruct

Alibaba

Q: 13.6$0.12/1M160K ctx

Faster: 244%Cheaper: 84%

Tri-21B-think Preview

Trillion Labs

Q: 13.6N/A/1M

GPT-4.5 (Preview)

OpenAI

Q: 13.6N/A/1M

Compare all 7 models