Pricing
Input
$0.20
per 1M tokens
Output
$0.80
per 1M tokens
Blended
$0.35
per 1M tokens
Cheaper than 59% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.35
Monthly
$10.50
vs. Similar Models
Qwen2.5 TurboQ:0.0
$0.09-75%
Llama 3.2 Instruct 90B (Vision)Q:-0.1
$1.38+294%
Solar MiniQ:-0.1
$0.15-57%
AllenAI: Olmo 3 32B ThinkQ:+0.1
$0.24-32%
Performance
85
tokens/sec
Faster than 44% of models
1.72
seconds
Faster than 28% of models
1.72
seconds
Faster than 53% of models
Market Median
94 tok/s
9% slower
Median TTFT
1.10s
55% slower
Throughput/Dollar
243
tok/s per $/1M
Speed Comparison
Mistral: Mistral Medium 3.1
85 tok/s+0%
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
85 tok/s+0%
OpenAI: GPT-5.2
85 tok/s+0%
Benchmarks
MMLU-ProNot evaluated
GPQA DiamondNot evaluated
HLENot evaluated
LiveCodeBenchNot evaluated
SciCodeNot evaluated
TerminalBench HardNot evaluated
MATH-500
52.9%
AIMENot evaluated
AIME 2025Not evaluated
IFBenchNot evaluated
Long Context RecallNot evaluated
Tau2Not evaluated
Market AverageTop Score