Related Models
Pricing
Input
$0.02
per 1M tokens
Output
$0.10
per 1M tokens
Blended
$0.04
per 1M tokens
Cheaper than 91% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.04
Monthly
$1.20
vs. Similar Models
Qwen2.5 MaxQ:0.0
$2.80+6900%
Qwen3 14B (Reasoning)Q:-0.1
$0.73+1727%
Qwen3 VL 30B A3B InstructQ:-0.2
$0.23+469%
Hermes 4 - Llama-3.1 70B (Reasoning)Q:-0.2
$0.20+395%
Performance
36
tokens/sec
Faster than 5% of models
0.41
seconds
Faster than 95% of models
55.20
seconds
Faster than 4% of models
Market Median
94 tok/s
61% slower
Median TTFT
1.11s
64% faster
Throughput/Dollar
912
tok/s per $/1M
Speed Comparison
Claude 4.1 Opus (Non-reasoning)
37 tok/s+1%
Gemma 3 27B Instruct
36 tok/s-1%
Qwen3.5 2B (Non-reasoning)
37 tok/s+1%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
45.6%
HLE
2.1%
LiveCodeBenchNot evaluated
SciCode
2.8%
TerminalBench Hard
3.8%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
31.5%
Long Context Recall
23.7%
Tau2
69.0%
Market AverageTop Score