Skip to main content
Back to Explore

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA·Released 2025-07-25
Open Source

Pricing

Input

$0.10

per 1M tokens

Output

$0.40

per 1M tokens

Blended

$0.17

per 1M tokens

Cheaper than 72% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.17

Monthly

$5.25

vs. Similar Models

Devstral MediumQ:0.0
$0.80+357%
Mistral Small 4 (Non-reasoning)Q:0.0
$0.26+50%
Gemma 4 E4B (Reasoning)Q:+0.1
$0.54+207%
Mistral: Mistral Medium 3Q:+0.1
$0.80+357%

Performance

51

tokens/sec

Faster than 17% of models

0.27

seconds

Faster than 98% of models

39.43

seconds

Faster than 11% of models

Market Median

94 tok/s

46% slower

Median TTFT

1.11s

76% faster

Throughput/Dollar

292

tok/s per $/1M

Speed Comparison

Anthropic: Claude Opus 4.7
51 tok/s-0%
Kimi K2.6 (Non-reasoning)
51 tok/s+1%
Claude 4.5 Sonnet (Reasoning)
52 tok/s+1%

Benchmarks

MMLU-Pro
81.4%
GPQA Diamond
74.8%
HLE
6.8%
LiveCodeBench
73.7%
SciCode
34.8%
TerminalBench Hard
5.3%
MATH-500
98.3%
AIME
86.0%
AIME 2025
76.7%
IFBench
37.0%
Long Context Recall
34.0%
Tau2
28.1%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models