Related Models
Pricing
Input
$0.08
per 1M tokens
Output
$0.29
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 79% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.13
Monthly
$3.99
vs. Similar Models
Llama 3.1 Instruct 70BQ:0.0
$0.56+321%
Ministral 3 3BQ:0.0
$0.10-25%
Qwen3 4B (Non-reasoning)Q:0.0
$0.19+41%
IBM: Granite 4.1 8BQ:-0.1
$0.06-53%
Performance
111
tokens/sec
Faster than 60% of models
1.02
seconds
Faster than 55% of models
1.02
seconds
Faster than 68% of models
Market Median
94 tok/s
18% faster
Median TTFT
1.11s
8% faster
Throughput/Dollar
838
tok/s per $/1M
Speed Comparison
Claude 4.5 Haiku (Reasoning)
111 tok/s-0%
LFM2 24B A2B
111 tok/s-0%
GPT-5.4 (Non-reasoning)
111 tok/s-1%
Benchmarks
MMLU-Pro
71.0%
GPQA Diamond
51.5%
HLE
4.6%
LiveCodeBench
32.2%
SciCode
26.4%
TerminalBench Hard
6.8%
MATH-500
86.3%
AIME
26.0%
AIME 2025
21.7%
IFBench
31.9%
Long Context Recall
0.0%
Tau2
22.2%
Market AverageTop Score