Skip to main content
Back to Explore

Qwen3 VL 8B (Reasoning)

Alibaba·Released 2025-10-14
Open Source

Pricing

Input

$0.18

per 1M tokens

Output

$2.10

per 1M tokens

Blended

$0.66

per 1M tokens

Cheaper than 47% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.66

Monthly

$19.80

vs. Similar Models

Nova 2.0 Omni (Non-reasoning)Q:-0.1
$0.85+29%
Qwen3 32B (Reasoning)Q:-0.1
$0.28-58%
DeepSeek V3 (Dec '24)Q:-0.2
$0.52-21%
Qwen3 235B A22B (Non-reasoning)Q:+0.3
$0.79+19%

Performance

135

tokens/sec

Faster than 67% of models

1.06

seconds

Faster than 53% of models

15.87

seconds

Faster than 30% of models

Market Median

94 tok/s

43% faster

Median TTFT

1.11s

5% faster

Throughput/Dollar

205

tok/s per $/1M

Speed Comparison

Z.ai: GLM 5.2
135 tok/s+0%
Anthropic: Claude 3 Haiku
134 tok/s-1%
OpenAI: GPT-4.1
134 tok/s-1%

Benchmarks

MMLU-Pro
74.9%
GPQA Diamond
57.9%
HLE
3.3%
LiveCodeBench
35.3%
SciCode
21.9%
TerminalBench Hard
3.8%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
30.7%
IFBench
39.9%
Long Context Recall
31.0%
Tau2
22.5%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models