Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.20
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 80% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.13
Monthly
$3.75
vs. Similar Models
Qwen3 0.6B (Non-reasoning)Q:0.0
$0.19+50%
Gemma 3 4B InstructQ:+0.1
$0.05-60%
Llama 3.2 Instruct 1BQ:+0.1
$0.05-60%
Gemma 3n E4B InstructQ:+0.2
$0.03-80%
Performance
148
tokens/sec
Faster than 73% of models
1.88
seconds
Faster than 24% of models
1.88
seconds
Faster than 51% of models
Market Median
94 tok/s
58% faster
Median TTFT
1.10s
70% slower
Throughput/Dollar
1186
tok/s per $/1M
Speed Comparison
Qwen3.5 122B A10B
148 tok/s+0%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
148 tok/s-0%
GPT-5 nano (minimal)
149 tok/s+0%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
25.6%
HLE
5.0%
LiveCodeBenchNot evaluated
SciCode
4.1%
TerminalBench Hard
0.0%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
22.4%
Long Context Recall
0.0%
Tau2
11.4%
Market AverageTop Score