Loading...
Loading...
A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.
Input
$0.11
per 1M tokens
Output
$0.19
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 80% of models. Median price is $0.54/1M tokens.
Daily
$0.13
Monthly
$3.90
Context Window
4K
tokens
Larger than 0% of models
Max Output
3K
tokens
69% of context
Context Window Comparison
Price/1M
$0.13
128th cheapest
76% below median
Top 20%
Context Window
4K
420th largest