Loading...
Loading...
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
Input
$1.20
per 1M tokens
Output
$1.20
per 1M tokens
Blended
$1.20
per 1M tokens
Cheaper than 33% of models. Median price is $0.54/1M tokens.
Daily
$1.20
Monthly
$36.00
Context Window
131K
tokens
Larger than 28% of models
Max Output
16K
tokens
13% of context
Context Window Comparison
Price/1M
$1.20
439th cheapest
124% above median
Top 67%
Context Window
131K
219th largest
Top 72%