Skip to main content
Back to Explore

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA·Released 2026-04-29
Open Source

Pricing

Input

$0.07

per 1M tokens

Output

$0.30

per 1M tokens

Blended

$0.13

per 1M tokens

Cheaper than 80% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.13

Monthly

$3.93

vs. Similar Models

OpenAI: gpt-oss-20bQ:0.0
$0.06-57%
Mistral: Mistral Medium 3.1Q:-0.1
$0.80+511%
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Q:+0.2
$0.17+34%
GPT-5 (ChatGPT)Q:+0.4
$3.44+2524%

Performance

298

tokens/sec

Faster than 96% of models

0.59

seconds

Faster than 78% of models

7.30

seconds

Faster than 43% of models

Market Median

94 tok/s

216% faster

Median TTFT

1.11s

47% faster

Throughput/Dollar

2274

tok/s per $/1M

Speed Comparison

Llama 3.1 Nemotron Instruct 70B
301 tok/s+1%
OpenAI: gpt-oss-120b
307 tok/s+3%
Nova Micro
289 tok/s-3%

Benchmarks

MMLU-ProNot evaluated
GPQA Diamond
46.9%
HLE
5.3%
LiveCodeBenchNot evaluated
SciCode
27.8%
TerminalBench Hard
8.3%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
63.2%
Long Context Recall
35.7%
Tau2
45.3%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models