Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.23
per 1M tokens
Blended
$0.14
per 1M tokens
Cheaper than 78% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.14
Monthly
$4.14
vs. Similar Models
Llama 3.3 Instruct 70BQ:0.0
$0.61+343%
OpenAI: GPT-4o (2024-05-13)Q:0.0
$7.50+5335%
Qwen3 32B (Non-reasoning)Q:0.0
$0.26+88%
Llama Nemotron Super 49B v1.5 (Non-reasoning)Q:+0.1
$0.17+27%
Performance
151
tokens/sec
Faster than 75% of models
0.53
seconds
Faster than 85% of models
0.53
seconds
Faster than 89% of models
Market Median
94 tok/s
60% faster
Median TTFT
1.11s
52% faster
Throughput/Dollar
1095
tok/s per $/1M
Speed Comparison
Mistral Small (Feb '24)
151 tok/s+0%
Apriel-v1.5-15B-Thinker
151 tok/s-0%
xAI: Grok 4.3
150 tok/s-0%
Benchmarks
MMLU-Pro
65.9%
GPQA Diamond
45.4%
HLE
4.8%
LiveCodeBench
21.2%
SciCode
26.5%
TerminalBench Hard
7.6%
MATH-500
70.7%
AIME
9.3%
AIME 2025
3.7%
IFBench
29.9%
Long Context Recall
19.7%
Tau2
25.1%
Market AverageTop Score