Pricing
Input
$0.04
per 1M tokens
Output
$0.17
per 1M tokens
Blended
$0.07
per 1M tokens
Cheaper than 87% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.07
Monthly
$2.22
vs. Similar Models
Magistral Small 1.2Q:0.0
$0.75+914%
Claude 3 OpusQ:-0.1
$30.00+40441%
Devstral Small (May '25)Q:-0.1
$0.07+1%
Nova 2.0 Lite (Non-reasoning)Q:-0.1
$0.85+1049%
Performance
147
tokens/sec
Faster than 72% of models
1.24
seconds
Faster than 42% of models
14.87
seconds
Faster than 33% of models
Market Median
94 tok/s
57% faster
Median TTFT
1.10s
12% slower
Throughput/Dollar
1983
tok/s per $/1M
Speed Comparison
xAI: Grok 4.3
146 tok/s-0%
Qwen: Qwen3.6 35B A3B
147 tok/s+0%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
148 tok/s+1%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
73.8%
HLE
10.1%
LiveCodeBenchNot evaluated
SciCode
26.4%
TerminalBench Hard
1.5%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
34.4%
Long Context Recall
0.0%
Tau2
46.8%
Market AverageTop Score