Sobre
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Modelos Relacionados
Nemotron 3 Ultra 550B A55B (Reasoning)2026-06-04NVIDIA: Nemotron 3 Ultra (free)2026-06-04NVIDIA: Nemotron 3.5 Content Safety (free)2026-06-04Nemotron 3 Nano Omni 30B A3B Reasoning2026-04-29NVIDIA: Nemotron 3 Nano Omni (free)2026-04-28Nemotron Cascade 2 30B A3B2026-03-19NVIDIA: Nemotron 3 Super2026-03-11NVIDIA: Nemotron 3 Super (free)2026-03-11
Preços
Entrada
$0.50
por 1M tokens
Saída
$2.50
por 1M tokens
Combinado
$1.00
por 1M tokens
Mais barato que 36% dos modelos. Preço mediano é $0.56/1M tokens.
Calculadora de Custo
Tokens por dia1M
100K100M
Diário
$1.00
Mensal
$30.00
vs. Modelos Similares
GLM-4.7 (Reasoning)
$1.000%
GLM-4.7 (Non-reasoning)
$1.000%
GLM-4.6 (Non-reasoning)
$1.000%
GLM-4.5 (Reasoning)
$1.000%
Desempenho
Janela de Contexto
1.0M
tokens
Maior que 81% dos modelos
Saída Máxima
16K
tokens
2% do contexto
Comparação de Janela de Contexto
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
1.0MIgual
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
1.0MIgual
Qwen3.7 Max
1.0MIgual