About
Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...
Related Models
Pricing
Input
$0.15
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.26
per 1M tokens
Cheaper than 64% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.26
Monthly
$7.87
vs. Similar Models
Performance
100
tokens/sec
Faster than 53% of models
0.67
seconds
Faster than 73% of models
0.67
seconds
Faster than 81% of models
Market Median
94 tok/s
6% faster
Median TTFT
1.11s
40% faster
Throughput/Dollar
383
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 90% of models
Max Output
16K
tokens
2% of context