Related Models
Pricing
Input
$0.11
per 1M tokens
Output
$0.25
per 1M tokens
Blended
$0.14
per 1M tokens
Cheaper than 77% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.14
Monthly
$4.35
vs. Similar Models
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)Q:0.0
$0.09-39%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)Q:0.0
$0.09-41%
Qwen3 8B (Reasoning)Q:0.0
$0.37+155%
Mistral Large 2407Q:-0.1
$3.00+1969%
Performance
36
tokens/sec
Faster than 5% of models
0.66
seconds
Faster than 73% of models
0.66
seconds
Faster than 81% of models
Market Median
94 tok/s
62% slower
Median TTFT
1.11s
41% faster
Throughput/Dollar
250
tok/s per $/1M
Speed Comparison
Qwen3.5 2B
36 tok/s+1%
Claude 4.1 Opus (Non-reasoning)
37 tok/s+1%
Qwen3.5 2B (Non-reasoning)
37 tok/s+2%
Benchmarks
MMLU-Pro
66.9%
GPQA Diamond
42.8%
HLE
4.7%
LiveCodeBench
13.7%
SciCode
21.2%
TerminalBench Hard
3.8%
MATH-500
88.3%
AIME
25.3%
AIME 2025
20.7%
IFBench
31.8%
Long Context Recall
5.7%
Tau2
10.5%
Market AverageTop Score