Skip to main content
Back to Explore

Qwen3 VL 30B A3B (Reasoning)

Alibaba·Released 2025-10-03
Open Source

Pricing

Input

$0.20

per 1M tokens

Output

$0.75

per 1M tokens

Blended

$0.34

per 1M tokens

Cheaper than 60% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.34

Monthly

$10.14

vs. Similar Models

QwQ 32BQ:+0.1
$0.74+120%
Qwen3 235B A22B (Reasoning)Q:+0.1
$2.63+677%
Gemma 4 12B (Non-reasoning)Q:-0.1
$0.15-56%
Google: Gemini 2.5 Flash Lite Preview 09-2025Q:-0.2
$0.17-48%

Performance

124

tokens/sec

Faster than 63% of models

1.20

seconds

Faster than 43% of models

17.27

seconds

Faster than 29% of models

Market Median

94 tok/s

32% faster

Median TTFT

1.11s

8% slower

Throughput/Dollar

368

tok/s per $/1M

Speed Comparison

inclusionAI: Ring-2.6-1T
123 tok/s-1%
OpenAI: GPT-4o (2024-05-13)
126 tok/s+1%
Nova 2.0 Pro Preview (medium)
127 tok/s+2%

Benchmarks

MMLU-Pro
80.7%
GPQA Diamond
72.0%
HLE
8.7%
LiveCodeBench
69.7%
SciCode
28.8%
TerminalBench Hard
5.3%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
82.3%
IFBench
45.1%
Long Context Recall
40.7%
Tau2
19.9%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models