Related Models
Pricing
Input
$0.11
per 1M tokens
Output
$1.15
per 1M tokens
Blended
$0.37
per 1M tokens
Cheaper than 58% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.37
Monthly
$11.10
vs. Similar Models
Gemma 3 27B InstructQ:0.0
$0.14-61%
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)Q:0.0
$0.09-76%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)Q:0.0
$0.09-77%
Mistral Large 2407Q:-0.1
$3.00+711%
Performance
61
tokens/sec
Faster than 29% of models
1.51
seconds
Faster than 34% of models
34.53
seconds
Faster than 14% of models
Market Median
94 tok/s
35% slower
Median TTFT
1.10s
37% slower
Throughput/Dollar
164
tok/s per $/1M
Speed Comparison
Gemma 4 E4B (Reasoning)
60 tok/s-1%
Jamba 1.7 Large
61 tok/s+1%
Jamba 1.6 Large
61 tok/s+1%
Benchmarks
MMLU-Pro
74.3%
GPQA Diamond
58.9%
HLE
4.2%
LiveCodeBench
40.6%
SciCode
22.6%
TerminalBench Hard
2.3%
MATH-500
90.4%
AIME
74.7%
AIME 2025
19.0%
IFBench
33.5%
Long Context Recall
0.0%
Tau2
27.8%
Market AverageTop Score