Pricing
Input
$0.03
per 1M tokens
Output
$0.11
per 1M tokens
Blended
$0.05
per 1M tokens
Cheaper than 90% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.05
Monthly
$1.41
vs. Similar Models
Olmo 3.1 32B InstructQ:-0.1
$0.30+538%
IBM: Granite 4.1 8BQ:+0.1
$0.06+33%
AllenAI: Olmo 3 32B ThinkQ:-0.2
$0.24+405%
Mistral: SabaQ:-0.2
$0.30+538%
Performance
244
tokens/sec
Faster than 94% of models
1.14
seconds
Faster than 47% of models
9.35
seconds
Faster than 41% of models
Market Median
94 tok/s
160% faster
Median TTFT
1.10s
3% slower
Throughput/Dollar
5183
tok/s per $/1M
Speed Comparison
Qwen3.5 Omni Flash
243 tok/s-0%
Grok 4.20 0309 (Reasoning)
240 tok/s-1%
OpenAI: gpt-oss-20b
238 tok/s-2%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
63.3%
HLE
7.0%
LiveCodeBenchNot evaluated
SciCode
19.2%
TerminalBench Hard
2.3%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
26.5%
Long Context Recall
0.0%
Tau2
34.5%
Market AverageTop Score