NVIDIA: Nemotron 3 Super

NVIDIA·Released 2026-03-11

Open Source1.0M ctx

About

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Price/1M

$0.16

177th cheapest

70% below median

Top 26%

Context Window

1.0M

51st largest

Top 20%

Pricing

Input

$0.08

per 1M tokens

Output

$0.40

per 1M tokens

Blended

$0.16

per 1M tokens

Cheaper than 74% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.16

Monthly

$4.91

vs. Similar Models

Qwen: Qwen3 30B A3B Thinking 2507

$0.16-2%

Meta: Llama 3.3 70B Instruct

$0.15-5%

GLM-4.7-Flash (Non-reasoning)

$0.15-7%

MiMo-V2.5

$0.17+7%

Performance

Context Window

1.0M

tokens

Larger than 80% of models

Max Output

16K

tokens

2% of context

Context Window Comparison

Anthropic: Claude Fable 5

1.0MSame

Anthropic: Claude Opus 4.8

1.0MSame

Anthropic: Claude Opus 4.7

1.0MSame

Open Source

Quick Compare

Similar Models

Nemotron 3 Ultra 550B A55B (Reasoning)

NVIDIA

Q: 37.8$1.18/1M

Pricier: 618%

NVIDIA Nemotron 3 Super 120B A12B (Reasoning)

NVIDIA

Q: 25.4$0.41/1M

Pricier: 152%

Nemotron Cascade 2 30B A3B

NVIDIA

Q: 21.3N/A/1M

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA

Q: 14.9$0.13/1M

Cheaper: 20%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.10/1M

Cheaper: 41%

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA

Q: 12.4$0.17/1M

Compare all 7 models

NVIDIA: Nemotron 3 Super

About

Related Models

Pricing

Cost Calculator

vs. Similar Models

Performance

Open Source

Quick Compare

Similar Models