About
Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...
Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.30
per 1M tokens
Blended
$0.15
per 1M tokens
Cheaper than 75% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.15
Monthly
$4.50
vs. Similar Models
Performance
109
tokens/sec
Faster than 58% of models
0.61
seconds
Faster than 76% of models
0.61
seconds
Faster than 83% of models
Market Median
94 tok/s
16% faster
Median TTFT
1.11s
45% faster
Throughput/Dollar
730
tok/s per $/1M
Speed Comparison
Context Window
10.0M
tokens
Larger than 100% of models
Max Output
16K
tokens
0% of context