Related Models
NVIDIA Nemotron 3 Nano 4B2026-03-16NVIDIA Nemotron 3 Super 120B A12B (Reasoning)2026-03-11NVIDIA Nemotron 3 Super 120B A12B BF162026-03-10NVIDIA Nemotron 3 Nano 4B BF162026-03-07NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)2025-12-15NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)2025-12-15NVIDIA Nemotron 3 Nano 30B A3B BF162025-12-04NVIDIA Nemotron Parse v1.12025-11-15
Pricing
Input
$0.04
per 1M tokens
Output
$0.16
per 1M tokens
Blended
$0.07
per 1M tokens
Cheaper than 87% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.07
Monthly
$2.10
vs. Similar Models
Gemini 2.0 Flash-Lite (Feb '25)Q:0.0
$0.13+87%
Hermes 4 - Llama-3.1 405B (Non-reasoning)Q:0.0
$1.50+2043%
Qwen3.5 2B (Non-reasoning)Q:0.0
$0.04-43%
Gemma 4 E4B (Non-reasoning)Q:+0.1
$0.54+667%
Performance
75
tokens/sec
Faster than 37% of models
8.87
seconds
Faster than 12% of models
35.43
seconds
Faster than 13% of models
Market Median
94 tok/s
20% slower
Median TTFT
1.10s
702% slower
Throughput/Dollar
1076
tok/s per $/1M
Speed Comparison
Z.ai: GLM 4.5 Air
75 tok/s-0%
MiniMax: MiniMax M3
75 tok/s+0%
DeepSeek: DeepSeek V4 Pro
75 tok/s+0%
Benchmarks
MMLU-Pro
74.2%
GPQA Diamond
57.0%
HLE
4.6%
LiveCodeBench
72.4%
SciCode
22.0%
TerminalBench Hard
1.5%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
69.7%
IFBench
27.6%
Long Context Recall
21.0%
Tau2
21.9%
Market AverageTop Score