NVIDIA: Nemotron 3 Ultra

NVIDIA·Released 2026-06-04

Open Source1.0M ctx

Comparison data ready55% coverage10/10 fields directly observedUpdated Jul 19, 2026, 5:30 PM

About

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Price/1M

$1.35

470th cheapest

93% above median

Top 65%

Context Window

1.0M

62nd largest

Top 21%

Pricing

Input

$0.60

per 1M tokens

Output

$3.60

per 1M tokens

Blended

$1.35

per 1M tokens

Cheaper than 35% of models. Median price is $0.70/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$1.35

Monthly

$40.50

vs. Similar Models

Qwen3.6 27B (Reasoning)

$1.35+0%

Qwen3.5 397B A17B (Reasoning)

$1.35+0%

Qwen3.5 397B A17B (Non-reasoning)

$1.35+0%

Qwen3.6 27B (Non-reasoning)

$1.35+0%

Performance

Context Window

1.0M

tokens

Larger than 79% of models

Max Output

16K

tokens

2% of context

Context Window Comparison

Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)

1.0MSame

Claude Opus 4.8 (Adaptive Reasoning, Max Effort)

1.0MSame

Claude Opus 4.7 (Adaptive Reasoning, Max Effort)

1.0MSame

Open Source

Quick Compare

Similar Models

Nemotron 3 Ultra 550B A55B (Reasoning)

NVIDIA

Q: 37.8$1.18/1M

Cheaper: 13%

NVIDIA Nemotron 3 Super 120B A12B (Reasoning)

NVIDIA

Q: 25.4$0.38/1M

Cheaper: 72%

Nemotron Cascade 2 30B A3B

NVIDIA

Q: 21.3N/A/1M

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA

Q: 14.9$0.13/1M

Cheaper: 90%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.09/1M

Cheaper: 93%

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA

Q: 12.4$0.40/1M

Cheaper: 70%

Compare all 7 models

NVIDIA: Nemotron 3 Ultra

About

Related Models

Pricing

Cost Calculator

vs. Similar Models

Performance

Open Source

Quick Compare

Similar Models