Related Models
Pricing
Input
$0.60
per 1M tokens
Output
$3.60
per 1M tokens
Blended
$1.35
per 1M tokens
Cheaper than 31% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$1.35
Monthly
$40.50
vs. Similar Models
Qwen: Qwen3.5-122B-A10BQ:+0.3
$0.72-47%
Qwen: Qwen3 Max ThinkingQ:-0.3
$1.56+16%
GLM-5 (Non-reasoning)Q:+0.4
$1.55+15%
Qwen: Qwen3.6 35B A3BQ:-0.4
$0.35-74%
Performance
52
tokens/sec
Faster than 20% of models
1.72
seconds
Faster than 28% of models
1.72
seconds
Faster than 53% of models
Market Median
94 tok/s
45% slower
Median TTFT
1.10s
56% slower
Throughput/Dollar
38
tok/s per $/1M
Speed Comparison
Llama 3.2 Instruct 3B
52 tok/s+0%
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
52 tok/s-0%
Claude 4.5 Sonnet (Non-reasoning)
52 tok/s-0%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
86.1%
HLE
18.8%
LiveCodeBenchNot evaluated
SciCode
41.1%
TerminalBench Hard
35.6%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
51.6%
Long Context Recall
58.0%
Tau2
83.9%
Market AverageTop Score