Related Models
Pricing
Input
$0.20
per 1M tokens
Output
$1.25
per 1M tokens
Blended
$0.46
per 1M tokens
Cheaper than 52% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.46
Monthly
$13.89
vs. Similar Models
MiniMax M1 80kQ:+0.1
$0.96+108%
gpt-oss-120b (low)Q:+0.1
$0.26-43%
Nova 2.0 Lite (low)Q:+0.2
$0.85+84%
Perplexity: Sonar Reasoning ProQ:+0.2
$3.50+656%
Performance
161
tokens/sec
Faster than 78% of models
0.47
seconds
Faster than 89% of models
0.47
seconds
Faster than 94% of models
Market Median
94 tok/s
71% faster
Median TTFT
1.10s
58% faster
Throughput/Dollar
347
tok/s per $/1M
Speed Comparison
OpenAI: GPT-5 Nano
160 tok/s-0%
Gemini 3 Pro Preview (high)
161 tok/s+1%
GLM-4.7-Flash (Non-reasoning)
162 tok/s+1%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
55.8%
HLE
4.2%
LiveCodeBenchNot evaluated
SciCode
35.2%
TerminalBench Hard
24.2%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
32.7%
Long Context Recall
24.7%
Tau2
34.8%
Market AverageTop Score