Related Models
Pricing
Input
$1.25
per 1M tokens
Output
$10.00
per 1M tokens
Blended
$3.44
per 1M tokens
Cheaper than 17% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$3.44
Monthly
$103.14
vs. Similar Models
Grok 4.20 0309 v2 (Non-reasoning)Q:0.0
$3.00-13%
Gemma 4 12B (Reasoning)Q:+0.2
$0.15-96%
Grok Code Fast 1Q:-0.2
$0.53-85%
HyperNova 60B 2605Q:+0.3
$0.07-98%
Performance
127
tokens/sec
Faster than 64% of models
16.76
seconds
Faster than 7% of models
32.56
seconds
Faster than 15% of models
Market Median
94 tok/s
34% faster
Median TTFT
1.11s
1406% slower
Throughput/Dollar
37
tok/s per $/1M
Speed Comparison
OpenAI: GPT-4o (2024-05-13)
126 tok/s-0%
GPT-4o (Aug '24)
128 tok/s+1%
Qwen3 VL 30B A3B (Reasoning)
124 tok/s-2%
Benchmarks
MMLU-Pro
83.0%
GPQA Diamond
78.5%
HLE
8.9%
LiveCodeBench
73.0%
SciCode
42.7%
TerminalBench Hard
24.2%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
89.0%
IFBench
79.0%
Long Context Recall
54.3%
Tau2
92.7%
Market AverageTop Score