Related Models
Pricing
Input
$0.08
per 1M tokens
Output
$0.29
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 79% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.13
Monthly
$3.99
vs. Similar Models
Llama 3.1 Instruct 70BQ:0.0
$0.56+321%
Ministral 3 3BQ:0.0
$0.10-25%
Qwen3 4B (Non-reasoning)Q:0.0
$0.19+41%
IBM: Granite 4.1 8BQ:-0.1
$0.06-53%
Performance
112
tokens/sec
Faster than 60% of models
1.00
seconds
Faster than 57% of models
1.00
seconds
Faster than 70% of models
Market Median
94 tok/s
20% faster
Median TTFT
1.10s
9% faster
Throughput/Dollar
846
tok/s per $/1M
Speed Comparison
DeepSeek V4 Flash
112 tok/s-0%
GPT-5.4 (low)
111 tok/s-1%
Qwen3 30B A3B (Reasoning)
114 tok/s+1%
Benchmarks
MMLU-Pro
71.0%
GPQA Diamond
51.5%
HLE
4.6%
LiveCodeBench
32.2%
SciCode
26.4%
TerminalBench Hard
6.8%
MATH-500
86.3%
AIME
26.0%
AIME 2025
21.7%
IFBench
31.9%
Long Context Recall
0.0%
Tau2
22.2%
Market AverageTop Score