Related Models
Pricing
Input
$0.12
per 1M tokens
Output
$0.43
per 1M tokens
Blended
$0.20
per 1M tokens
Cheaper than 69% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.20
Monthly
$6.00
vs. Similar Models
MiniMax: MiniMax M2.5Q:+0.1
$0.21+5%
Qwen: Qwen3.5 397B A17BQ:+0.1
$0.90+351%
Claude 4.1 Opus (Reasoning)Q:+0.1
$30.00+14900%
GPT-5 (medium)Q:+0.1
$3.44+1619%
Performance
154
tokens/sec
Faster than 76% of models
1.80
seconds
Faster than 26% of models
14.83
seconds
Faster than 33% of models
Market Median
94 tok/s
63% faster
Median TTFT
1.11s
62% slower
Throughput/Dollar
768
tok/s per $/1M
Speed Comparison
Mistral Small (Sep '24)
153 tok/s-0%
Sarvam 105B (high)
153 tok/s-1%
Mistral Small 3.2
152 tok/s-1%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
86.7%
HLE
25.5%
LiveCodeBenchNot evaluated
SciCode
41.2%
TerminalBench Hard
34.1%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
63.1%
Long Context Recall
54.7%
Tau2
92.7%
Market AverageTop Score