Related Models
Pricing
Input
$0.24
per 1M tokens
Output
$0.24
per 1M tokens
Blended
$0.24
per 1M tokens
Cheaper than 67% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.24
Monthly
$7.35
vs. Similar Models
Gemma 3 12B InstructQ:+0.1
$0.14-43%
Llama 3 Instruct 70BQ:+0.2
$1.18+380%
Microsoft: Phi 4 Mini InstructQ:-0.3
$0.15-40%
Command-R+ (Apr '24)Q:-0.3
$6.00+2349%
Performance
110
tokens/sec
Faster than 58% of models
0.51
seconds
Faster than 86% of models
0.51
seconds
Faster than 91% of models
Market Median
94 tok/s
16% faster
Median TTFT
1.11s
54% faster
Throughput/Dollar
447
tok/s per $/1M
Speed Comparison
Meta: Llama 4 Scout
109 tok/s-0%
GPT-5.4 (low)
109 tok/s-0%
Kwaipilot: KAT-Coder-Pro V2
109 tok/s-0%
Benchmarks
MMLU-Pro
46.4%
GPQA Diamond
22.1%
HLE
5.2%
LiveCodeBench
11.0%
SciCode
11.2%
TerminalBench Hard
0.8%
MATH-500
51.6%
AIME
9.3%
AIME 2025
1.7%
IFBench
30.4%
Long Context Recall
11.7%
Tau2
14.6%
Market AverageTop Score