Related Models
Pricing
Input
$1.20
per 1M tokens
Output
$6.00
per 1M tokens
Blended
$2.40
per 1M tokens
Cheaper than 23% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$2.40
Monthly
$72.00
vs. Similar Models
Qwen: Qwen3.5-9BQ:0.0
$0.11-95%
Google: Gemini 3.1 Flash Lite PreviewQ:0.0
$0.56-77%
GLM-4.6 (Reasoning)Q:+0.1
$0.96-60%
Gemma 4 31B (Non-reasoning)Q:-0.2
$0.20-91%
Performance
52
tokens/sec
Faster than 20% of models
1.74
seconds
Faster than 28% of models
40.34
seconds
Faster than 10% of models
Market Median
95 tok/s
45% slower
Median TTFT
1.11s
57% slower
Throughput/Dollar
22
tok/s per $/1M
Speed Comparison
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
52 tok/s+0%
Qwen: Qwen3.5 397B A17B
52 tok/s+0%
Llama Nemotron Super 49B v1.5 (Reasoning)
52 tok/s-0%
Benchmarks
MMLU-Pro
82.4%
GPQA Diamond
77.6%
HLE
12.0%
LiveCodeBench
53.5%
SciCode
38.7%
TerminalBench Hard
17.4%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
82.3%
IFBench
53.8%
Long Context Recall
57.7%
Tau2
83.6%
Market AverageTop Score