Skip to main content
Back to Explore

QwQ 32B-Preview

Alibaba·Released 2024-11-27
Open Source

Related Models

Pricing

Input

$0.12

per 1M tokens

Output

$0.18

per 1M tokens

Blended

$0.14

per 1M tokens

Cheaper than 79% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.14

Monthly

$4.05

vs. Similar Models

GLM-4.5V (Reasoning)Q:-0.1
$0.90+567%
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Q:-0.1
$0.90+567%
Mistral Large 2 (Nov '24)Q:-0.1
$3.00+2122%
Mistral Small 3.2Q:-0.1
$0.13-5%

Performance

43

tokens/sec

Faster than 10% of models

0.46

seconds

Faster than 91% of models

46.70

seconds

Faster than 8% of models

Market Median

94 tok/s

54% slower

Median TTFT

1.11s

59% faster

Throughput/Dollar

320

tok/s per $/1M

Speed Comparison

DeepSeek R1 Distill Qwen 32B
43 tok/s-1%
Qwen: Qwen3 Max Thinking
44 tok/s+2%
Claude Opus 4.7 (Non-reasoning, High Effort)
42 tok/s-3%

Benchmarks

MMLU-Pro
64.8%
GPQA Diamond
55.7%
HLE
4.8%
LiveCodeBench
33.7%
SciCode
3.8%
TerminalBench HardNot evaluated
MATH-500
91.0%
AIME
45.3%
AIME 2025Not evaluated
IFBenchNot evaluated
Long Context RecallNot evaluated
Tau2Not evaluated
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models