NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA·Released 2025-10-10

Open Source49B131K ctxother

About

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Price/1M

$0.40

304th cheapest

26% below median

Top 45%

Context Window

131K

236th largest

Top 73%

Pricing

Input

$0.40

per 1M tokens

Output

$0.40

per 1M tokens

Blended

$0.40

per 1M tokens

Cheaper than 55% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.40

Monthly

$12.00

vs. Similar Models

Meta: Llama 3.1 70B Instruct

$0.400%

TheDrummer: UnslopNemo 12B

$0.400%

Qwen3 4B (Reasoning)

$0.40-0%

Qwen3 1.7B (Reasoning)

$0.40-0%

Performance

Context Window

131K

tokens

Larger than 27% of models

Max Output

16K

tokens

13% of context

Context Window Comparison

DeepSeek: DeepSeek V3.2

131KSame

OpenAI: gpt-oss-120b

131KSame

MoonshotAI: Kimi K2 0711

131KSame

Open Source

View model repository

other49BGGUF / GPTQ / AWQ

Downloads

821.3K

Likes

233

VRAM (FP16)

48-80 GB

GPU

A100 80GB

Quick Compare

Similar Models

Nemotron 3 Ultra 550B A55B (Reasoning)

NVIDIA

Q: 37.8$1.18/1M

Pricier: 194%

NVIDIA Nemotron 3 Super 120B A12B (Reasoning)

NVIDIA

Q: 25.4$0.41/1M

Nemotron Cascade 2 30B A3B

NVIDIA

Q: 21.3N/A/1M

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA

Q: 14.9$0.13/1M

Cheaper: 67%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.10/1M

Cheaper: 76%

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA

Q: 12.4$0.17/1M

Cheaper: 56%

Compare all 7 models

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

About

Related Models

Pricing

Cost Calculator

vs. Similar Models

Performance

Open Source

Quick Compare

Similar Models