Related Models
Pricing
Input
$0.28
per 1M tokens
Output
$1.85
per 1M tokens
Blended
$0.67
per 1M tokens
Cheaper than 47% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.67
Monthly
$20.19
vs. Similar Models
Mistral Large 3Q:+0.1
$0.75+11%
DeepSeek V3 0324Q:-0.1
$1.21+80%
Qwen3.5 4B (Non-reasoning)Q:+0.2
$0.06-91%
INTELLECT-3Q:-0.2
$0.43-37%
Performance
146
tokens/sec
Faster than 72% of models
1.04
seconds
Faster than 53% of models
14.76
seconds
Faster than 33% of models
Market Median
94 tok/s
55% faster
Median TTFT
1.11s
6% faster
Throughput/Dollar
217
tok/s per $/1M
Speed Comparison
Mistral Small 3
146 tok/s+0%
OpenAI: o3
145 tok/s-0%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
145 tok/s-1%
Benchmarks
MMLU-Pro
80.5%
GPQA Diamond
70.7%
HLE
9.8%
LiveCodeBench
70.7%
SciCode
33.3%
TerminalBench Hard
5.3%
MATH-500
97.6%
AIME
90.7%
AIME 2025
56.3%
IFBench
50.7%
Long Context Recall
59.0%
Tau2
28.1%
Market AverageTop Score