NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA·Released 2024-10-15

Open Source131K ctx

About

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

Price/1M

$1.20

458th cheapest

78% above median

Top 64%

Context Window

131K

250th largest

Top 73%

Pricing

Input

$1.20

per 1M tokens

Output

$1.20

per 1M tokens

Blended

$1.20

per 1M tokens

Cheaper than 36% of models. Median price is $0.67/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$1.20

Monthly

$36.00

vs. Similar Models

Kimi K2.5 (Non-reasoning)

$1.200%

Llama 3.1 Nemotron Instruct 70B

$1.200%

DeepSeek V3 0324

$1.21+1%

Baidu: Qianfan-OCR-Fast

$1.21+1%

Performance

Context Window

131K

tokens

Larger than 27% of models

Max Output

16K

tokens

13% of context

Context Window Comparison

DeepSeek: DeepSeek V3.2

131KSame

OpenAI: gpt-oss-120b

131KSame

MoonshotAI: Kimi K2 0711

131KSame

Open Source

Quick Compare

Similar Models

Nemotron 3 Ultra 550B A55B (Reasoning)

NVIDIA

Q: 37.8$1.18/1M

NVIDIA Nemotron 3 Super 120B A12B (Reasoning)

NVIDIA

Q: 25.4$0.38/1M

Cheaper: 68%

Nemotron Cascade 2 30B A3B

NVIDIA

Q: 21.3N/A/1M

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA

Q: 14.9$0.13/1M

Cheaper: 89%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.09/1M

Cheaper: 93%

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA

Q: 12.4$0.40/1M

Cheaper: 67%

Compare all 7 models

NVIDIA: Llama 3.1 Nemotron 70B Instruct

About

Related Models

Pricing

Cost Calculator

vs. Similar Models

Performance

Open Source

Quick Compare

Similar Models