Related Models
Pricing
Input
$0.06
per 1M tokens
Output
$0.20
per 1M tokens
Blended
$0.10
per 1M tokens
Cheaper than 84% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.10
Monthly
$2.85
vs. Similar Models
Qwen: Qwen3 VL 235B A22B InstructQ:0.0
$0.37+289%
GPT-5 mini (minimal)Q:0.0
$0.69+624%
Meta: Llama 4 MaverickQ:0.0
$0.26+176%
Nova 2.0 Pro Preview (Non-reasoning)Q:+0.1
$3.44+3519%
Performance
265
tokens/sec
Faster than 94% of models
0.62
seconds
Faster than 75% of models
8.18
seconds
Faster than 43% of models
Market Median
94 tok/s
180% faster
Median TTFT
1.11s
44% faster
Throughput/Dollar
2785
tok/s per $/1M
Speed Comparison
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
266 tok/s+0%
Gemini 2.5 Flash-Lite (Reasoning)
269 tok/s+2%
Sarvam 30B (high)
243 tok/s-8%
Benchmarks
MMLU-Pro
71.8%
GPQA Diamond
61.1%
HLE
5.1%
LiveCodeBench
65.2%
SciCode
34.0%
TerminalBench Hard
4.5%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
62.3%
IFBench
57.8%
Long Context Recall
31.0%
Tau2
50.3%
Market AverageTop Score