Related Models
Nemotron 3 Ultra 550B A55B (Reasoning)2026-06-04NVIDIA: Nemotron 3 Ultra2026-06-04NVIDIA: Nemotron 3 Ultra (free)2026-06-04NVIDIA: Nemotron 3.5 Content Safety (free)2026-06-04NVIDIA: Nemotron 3 Nano Omni (free)2026-04-28Nemotron Cascade 2 30B A3B2026-03-19NVIDIA: Nemotron 3 Super2026-03-11NVIDIA: Nemotron 3 Super (free)2026-03-11
Pricing
Input
$0.07
per 1M tokens
Output
$0.30
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 80% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.13
Monthly
$3.93
vs. Similar Models
OpenAI: gpt-oss-20bQ:0.0
$0.06-57%
Mistral: Mistral Medium 3.1Q:-0.1
$0.80+511%
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Q:+0.2
$0.17+34%
GPT-5 (ChatGPT)Q:+0.4
$3.44+2524%
Performance
298
tokens/sec
Faster than 96% of models
0.59
seconds
Faster than 78% of models
7.30
seconds
Faster than 43% of models
Market Median
94 tok/s
216% faster
Median TTFT
1.11s
47% faster
Throughput/Dollar
2274
tok/s per $/1M
Speed Comparison
Llama 3.1 Nemotron Instruct 70B
301 tok/s+1%
OpenAI: gpt-oss-120b
307 tok/s+3%
Nova Micro
289 tok/s-3%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
46.9%
HLE
5.3%
LiveCodeBenchNot evaluated
SciCode
27.8%
TerminalBench Hard
8.3%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
63.2%
Long Context Recall
35.7%
Tau2
45.3%
Market AverageTop Score