QwQ 32B — Alibaba | FindLLM

QwQ 32B

Alibaba·Released 2025-03-05

Open Source131K ctx

About

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...

Related Models

QwQ 32B-Preview2024-11-27

Quality Index

19.7

247th of 507

Top 49%

Math Index

29.0

188th of 269

Top 70%

Price/1M

$0.74

350th cheapest

33% above median

Top 55%

Speed

32 tok/s

Top 96%

TTFT

0.41s

Context Window

131K

201st largest

Top 67%

Market Position

QwQ 32BMarket Average

Pricing

Input

$0.66

per 1M tokens

Output

$1.00

per 1M tokens

Blended

$0.74

per 1M tokens

Cheaper than 45% of models. Median price is $0.56/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.74

Monthly

$22.35

vs. Similar Models

Qwen3 VL 30B A3B (Reasoning)Q:0.0

$0.34-55%

Qwen3 235B A22B (Reasoning)Q:+0.1

$2.63+252%

Qwen3 Coder 30B A3B InstructQ:+0.3

$0.12-84%

Google: Gemini 2.5 Flash Lite Preview 09-2025Q:-0.3

$0.17-77%

Performance

tokens/sec

Faster than 4% of models

0.41

seconds

Faster than 91% of models

79.32

seconds

Faster than 3% of models

Market Median

86 tok/s

63% slower

Median TTFT

1.07s

62% faster

Throughput/Dollar

tok/s per $/1M

Speed Comparison

Nous: Hermes 3 70B Instruct

32 tok/s-0%

GLM-4.6V (Reasoning)

31 tok/s-0%

OpenAI: GPT-4 Turbo

32 tok/s+2%

Context Window

131K

tokens

Larger than 33% of models

Max Output

131K

tokens

100% of context

Benchmarks

MMLU-Pro

76.4%

GPQA Diamond

59.3%

HLE

8.2%

LiveCodeBench

63.1%

SciCode

35.8%

TerminalBench HardNot evaluated

MATH-500

95.7%

AIME

78.0%

AIME 2025

29.0%

IFBench

38.8%

Long Context Recall

25.0%

Tau2Not evaluated

Market AverageTop Score

Open Source

Quick Compare

Similar Models

Qwen3 VL 30B A3B (Reasoning)

Alibaba

Q: 19.7$0.34/1M

Faster: 300%Cheaper: 55%

Gemini 2.0 Flash Thinking Experimental (Jan '25)

Google

Q: 19.6N/A/1M

Qwen3 235B A22B (Reasoning)

Alibaba

Q: 19.8$2.63/1M

Faster: 117%Pricier: 252%

Devstral Small 2

Mistral

Q: 19.5N/A/1M

Faster: 91%

Google: Gemini 2.5 Flash Lite Preview 09-2025

Google

Q: 19.4$0.17/1M1.0M ctx

Faster: 1020%Cheaper: 77%

Qwen3 Coder 30B A3B Instruct

Alibaba

Q: 20.0$0.12/1M160K ctx

Faster: 259%Cheaper: 84%

Compare all 7 models