Loading...
Loading...
Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...
Input
$0.15
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.26
per 1M tokens
Cheaper than 65% of models. Median price is $0.56/1M tokens.
Daily
$0.26
Monthly
$7.87
112
tokens/sec
Faster than 63% of models
0.64
seconds
Faster than 71% of models
0.64
seconds
Faster than 80% of models
Market Median
86 tok/s
31% faster
Median TTFT
1.07s
40% faster
Throughput/Dollar
428
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 91% of models
Max Output
16K
tokens
2% of context
Quality Index
18.4
271st of 507
Top 53%
Coding Index
15.6
241st of 417
Top 58%
Math Index
19.3
211th of 269
Top 78%
Price/1M
$0.26
218th cheapest
53% below median
Top 35%
Speed
112 tok/s
Top 37%
TTFT
0.64s
Context Window
1.0M
15th largest
Top 9%