QwQ 32B-Preview

Alibaba·Released 2024-11-27

Open Source

Compare

Related Models

QwQ 32B2025-03-05

Quality Index

9.2

345th of 537

Top 64%

Price/1M

$0.14

144th cheapest

75% below median

Top 21%

Speed

43 tok/s

Top 90%

TTFT

0.46s

Market Position

QwQ 32B-PreviewMarket Average

Pricing

Input

$0.12

per 1M tokens

Output

$0.18

per 1M tokens

Blended

$0.14

per 1M tokens

Cheaper than 79% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.14

Monthly

$4.05

vs. Similar Models

GLM-4.5V (Reasoning)Q:-0.1

$0.90+567%

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Q:-0.1

$0.90+567%

Mistral Large 2 (Nov '24)Q:-0.1

$3.00+2122%

Mistral Small 3.2Q:-0.1

$0.13-5%

Performance

tokens/sec

Faster than 10% of models

0.46

seconds

Faster than 91% of models

46.70

seconds

Faster than 8% of models

Market Median

94 tok/s

54% slower

Median TTFT

1.11s

59% faster

Throughput/Dollar

320

tok/s per $/1M

Speed Comparison

DeepSeek R1 Distill Qwen 32B

43 tok/s-1%

Qwen: Qwen3 Max Thinking

44 tok/s+2%

Claude Opus 4.7 (Non-reasoning, High Effort)

42 tok/s-3%

Benchmarks

MMLU-Pro

64.8%

GPQA Diamond

55.7%

HLE

4.8%

LiveCodeBench

33.7%

SciCode

3.8%

TerminalBench HardNot evaluated

MATH-500

91.0%

AIME

45.3%

AIME 2025Not evaluated

IFBenchNot evaluated

Long Context RecallNot evaluated

Tau2Not evaluated

Market AverageTop Score

Open Source

Quick Compare

Similar Models

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

NVIDIA

Q: 9.1$0.90/1M

Faster: 20%Pricier: 567%

Mistral Large 2 (Nov '24)

Mistral

Q: 9.1$3.00/1M

Faster: 29%Pricier: 2122%

Mistral Small 3.2

Mistral

Q: 9.1$0.13/1M

Faster: 251%

GLM-4.5V (Reasoning)

Z AI

Q: 9.1$0.90/1M

Faster: 31%Pricier: 567%

Qwen3 30B A3B 2507 Instruct

Alibaba

Q: 9.1$0.21/1M

Faster: 284%Pricier: 58%

Devstral Small (Jul '25)

Mistral

Q: 9.3$0.15/1M131K ctx

Faster: 48%Pricier: 11%

Compare all 7 models