Skip to main content
Back to Explore

Qwen3 8B (Reasoning)

Alibaba·Released 2025-04-28
Open Source

Pricing

Input

$0.11

per 1M tokens

Output

$1.15

per 1M tokens

Blended

$0.37

per 1M tokens

Cheaper than 58% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.37

Monthly

$11.10

vs. Similar Models

Gemma 3 27B InstructQ:0.0
$0.14-61%
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)Q:0.0
$0.09-76%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)Q:0.0
$0.09-77%
Mistral Large 2407Q:-0.1
$3.00+711%

Performance

61

tokens/sec

Faster than 29% of models

1.51

seconds

Faster than 34% of models

34.53

seconds

Faster than 14% of models

Market Median

94 tok/s

35% slower

Median TTFT

1.10s

37% slower

Throughput/Dollar

164

tok/s per $/1M

Speed Comparison

Gemma 4 E4B (Reasoning)
60 tok/s-1%
Jamba 1.7 Large
61 tok/s+1%
Jamba 1.6 Large
61 tok/s+1%

Benchmarks

MMLU-Pro
74.3%
GPQA Diamond
58.9%
HLE
4.2%
LiveCodeBench
40.6%
SciCode
22.6%
TerminalBench Hard
2.3%
MATH-500
90.4%
AIME
74.7%
AIME 2025
19.0%
IFBench
33.5%
Long Context Recall
0.0%
Tau2
27.8%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models