Skip to main content
Back to Explore

NVIDIA Nemotron Nano 9B V2 (Reasoning)

NVIDIA·Released 2025-08-18
Open Source

Pricing

Input

$0.04

per 1M tokens

Output

$0.16

per 1M tokens

Blended

$0.07

per 1M tokens

Cheaper than 87% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.07

Monthly

$2.10

vs. Similar Models

Gemini 2.0 Flash-Lite (Feb '25)Q:0.0
$0.13+87%
Hermes 4 - Llama-3.1 405B (Non-reasoning)Q:0.0
$1.50+2043%
Qwen3.5 2B (Non-reasoning)Q:0.0
$0.04-43%
Gemma 4 E4B (Non-reasoning)Q:+0.1
$0.54+667%

Performance

75

tokens/sec

Faster than 37% of models

8.87

seconds

Faster than 12% of models

35.43

seconds

Faster than 13% of models

Market Median

94 tok/s

20% slower

Median TTFT

1.10s

702% slower

Throughput/Dollar

1076

tok/s per $/1M

Speed Comparison

Z.ai: GLM 4.5 Air
75 tok/s-0%
MiniMax: MiniMax M3
75 tok/s+0%
DeepSeek: DeepSeek V4 Pro
75 tok/s+0%

Benchmarks

MMLU-Pro
74.2%
GPQA Diamond
57.0%
HLE
4.6%
LiveCodeBench
72.4%
SciCode
22.0%
TerminalBench Hard
1.5%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
69.7%
IFBench
27.6%
Long Context Recall
21.0%
Tau2
21.9%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models