Related Models
Pricing
Input
$0.25
per 1M tokens
Output
$2.00
per 1M tokens
Blended
$0.69
per 1M tokens
Cheaper than 46% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.69
Monthly
$20.64
vs. Similar Models
Qwen: Qwen3 VL 235B A22B InstructQ:0.0
$0.37-46%
Meta: Llama 4 MaverickQ:0.0
$0.26-62%
gpt-oss-20B (low)Q:0.0
$0.10-86%
Nova 2.0 Pro Preview (Non-reasoning)Q:+0.1
$3.44+400%
Performance
101
tokens/sec
Faster than 54% of models
0.68
seconds
Faster than 71% of models
0.68
seconds
Faster than 80% of models
Market Median
94 tok/s
7% faster
Median TTFT
1.11s
39% faster
Throughput/Dollar
147
tok/s per $/1M
Speed Comparison
GPT-5.1 (Non-reasoning)
101 tok/s+0%
OpenAI: GPT-4.1 Mini
101 tok/s+0%
Meta: Llama 4 Maverick
100 tok/s-1%
Benchmarks
MMLU-Pro
77.5%
GPQA Diamond
68.7%
HLE
5.0%
LiveCodeBench
54.5%
SciCode
36.9%
TerminalBench Hard
14.4%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
46.7%
IFBench
45.6%
Long Context Recall
35.7%
Tau2
31.9%
Market AverageTop Score