Related Models
Pricing
Input
$0.38
per 1M tokens
Output
$2.25
per 1M tokens
Blended
$0.84
per 1M tokens
Cheaper than 41% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.84
Monthly
$25.32
vs. Similar Models
Qwen: Qwen3 MaxQ:-0.2
$1.56+85%
Arcee AI: Trinity Large ThinkingQ:+0.3
$0.39-54%
OpenAI: gpt-oss-120bQ:-0.4
$0.06-93%
DeepSeek: DeepSeek V3.2Q:+0.5
$0.26-70%
Performance
143
tokens/sec
Faster than 71% of models
1.30
seconds
Faster than 40% of models
1.30
seconds
Faster than 61% of models
Market Median
94 tok/s
53% faster
Median TTFT
1.10s
18% slower
Throughput/Dollar
169
tok/s per $/1M
Speed Comparison
Grok 4.3 (medium)
143 tok/s-0%
Qwen: Qwen3 VL 8B Instruct
143 tok/s-0%
Sarvam M (Reasoning)
143 tok/s-0%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
81.7%
HLE
12.5%
LiveCodeBenchNot evaluated
SciCode
1.3%
TerminalBench Hard
25.8%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
36.2%
Long Context Recall
56.7%
Tau2
85.1%
Market AverageTop Score