NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

NVIDIA·Released 2025-04-08

Open Source131K ctx

About

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

Price/1M

$0.90

420th cheapest

66% above median

Top 62%

Context Window

131K

236th largest

Top 73%

Pricing

Input

$0.60

per 1M tokens

Output

$1.80

per 1M tokens

Blended

$0.90

per 1M tokens

Cheaper than 38% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.90

Monthly

$27.00

vs. Similar Models

Z.ai: GLM 4.5V

$0.900%

AlfredPros: CodeLLaMa 7B Instruct Solidity

$0.900%

EleutherAI: Llemma 7b

$0.900%

Morph: Morph V3 Fast

$0.900%

Performance

Context Window

131K

tokens

Larger than 27% of models

Context Window Comparison

DeepSeek: DeepSeek V3.2

131KSame

OpenAI: gpt-oss-120b

131KSame

MoonshotAI: Kimi K2 0711

131KSame

Open Source

Quick Compare

Similar Models

Nemotron 3 Ultra 550B A55B (Reasoning)

NVIDIA

Q: 37.8$1.18/1M

Pricier: 31%

NVIDIA Nemotron 3 Super 120B A12B (Reasoning)

NVIDIA

Q: 25.4$0.41/1M

Cheaper: 54%

Nemotron Cascade 2 30B A3B

NVIDIA

Q: 21.3N/A/1M

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA

Q: 14.9$0.13/1M

Cheaper: 85%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.10/1M

Cheaper: 89%

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA

Q: 12.4$0.17/1M

Cheaper: 81%

Compare all 7 models

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

About

Related Models

Pricing

Cost Calculator

vs. Similar Models

Performance

Open Source

Quick Compare

Similar Models