Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.20
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 80% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.13
Monthly
$3.75
vs. Similar Models
Qwen3 0.6B (Non-reasoning)Q:0.0
$0.19+50%
Gemma 3 4B InstructQ:+0.1
$0.05-60%
Llama 3.2 Instruct 1BQ:+0.1
$0.05-60%
Gemma 3n E4B InstructQ:+0.2
$0.03-80%
Performance
148
tokens/sec
Faster than 73% of models
1.88
seconds
Faster than 24% of models
1.88
seconds
Faster than 51% of models
Market Median
94 tok/s
57% faster
Median TTFT
1.11s
68% slower
Throughput/Dollar
1186
tok/s per $/1M
Speed Comparison
Mistral Small 4 (Non-reasoning)
149 tok/s+0%
GLM-4.7-Flash (Non-reasoning)
148 tok/s-0%
Qwen3.6 35B A3B (Non-reasoning)
149 tok/s+0%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
25.6%
HLE
5.0%
LiveCodeBenchNot evaluated
SciCode
4.1%
TerminalBench Hard
0.0%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
22.4%
Long Context Recall
0.0%
Tau2
11.4%
Market AverageTop Score