Loading...
Loading...
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
Quality Index
19.7
191st of 444
Top 43%
Math Index
29.0
187th of 268
Top 70%
Price/1M
$0.74
431st cheapest
148% above median
Top 63%
Speed
32 tok/s
Top 57%
TTFT
0.45s
Context Window
131K
145th largest
Top 63%
Input
$0.66
per 1M tokens
Output
$1.00
per 1M tokens
Blended
$0.74
per 1M tokens
Cheaper than 37% of models. Median price is $0.30/1M tokens.
Daily
$0.74
Monthly
$22.35
32
tokens/sec
Faster than 43% of models
0.45
seconds
Faster than 48% of models
77.30
seconds
Faster than 1% of models
Market Median
45 tok/s
28% slower
Median TTFT
0.42s
9% slower
Throughput/Dollar
44
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 37% of models
Max Output
131K
tokens
100% of context