Related Models
NVIDIA Nemotron 3 Nano 4B2026-03-16NVIDIA Nemotron 3 Super 120B A12B (Reasoning)2026-03-11NVIDIA Nemotron 3 Super 120B A12B BF162026-03-10NVIDIA Nemotron 3 Nano 4B BF162026-03-07NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)2025-12-15NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)2025-12-15NVIDIA Nemotron 3 Nano 30B A3B BF162025-12-04NVIDIA Nemotron Parse v1.12025-11-15
Pricing
Input
$0.20
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.30
per 1M tokens
Cheaper than 62% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.30
Monthly
$9.00
vs. Similar Models
ERNIE 4.5 300B A47BQ:0.0
$0.48+62%
Hermes 4 - Llama-3.1 405B (Reasoning)Q:0.0
$1.50+400%
Ministral 3 8BQ:0.0
$0.15-50%
GLM-4.5V (Reasoning)Q:+0.1
$0.90+200%
Performance
266
tokens/sec
Faster than 95% of models
0.21
seconds
Faster than 99% of models
7.74
seconds
Faster than 43% of models
Market Median
94 tok/s
181% faster
Median TTFT
1.11s
81% faster
Throughput/Dollar
885
tok/s per $/1M
Speed Comparison
gpt-oss-20B (low)
265 tok/s-0%
Gemini 2.5 Flash-Lite (Reasoning)
269 tok/s+1%
Sarvam 30B (high)
243 tok/s-8%
Benchmarks
MMLU-Pro
75.9%
GPQA Diamond
57.2%
HLE
5.3%
LiveCodeBench
69.4%
SciCode
26.2%
TerminalBench Hard
4.5%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
75.0%
IFBench
31.9%
Long Context Recall
40.0%
Tau2
21.3%
Market AverageTop Score