Voltar para Explorar

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

NVIDIA·Lançado em 2025-04-08

Open Source131K ctx

Comparar Testar modelo

Sobre

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

Modelos Relacionados

NVIDIA: Llama 3.3 Nemotron Super 49B V1.52025-10-10 Llama Nemotron Super 49B v1.5 (Reasoning)2025-07-25 Llama Nemotron Super 49B v1.5 (Non-reasoning)2025-07-25 Llama 3.1 Nemotron Nano VL 8B V12025-06-03 Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)2025-05-20 Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)2025-04-07 Llama 3.3 Nemotron Super 49B v1 (Reasoning)2025-03-18 Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)2025-03-18

Preços

Entrada

$0.60

por 1M tokens

Saída

$1.80

por 1M tokens

Combinado

$0.90

por 1M tokens

Mais barato que 38% dos modelos. Preço mediano é $0.54/1M tokens.

Calculadora de Custo

Tokens por dia1M

100K100M

Diário

$0.90

Mensal

$27.00

vs. Modelos Similares

Z.ai: GLM 4.5V

$0.900%

AlfredPros: CodeLLaMa 7B Instruct Solidity

$0.900%

EleutherAI: Llemma 7b

$0.900%

Morph: Morph V3 Fast

$0.900%

Desempenho

Janela de Contexto

131K

tokens

Maior que 27% dos modelos

Comparação de Janela de Contexto

DeepSeek V3.2

131KIgual

gpt oss 120b

131KIgual

MoonshotAI: Kimi K2 0711

131KIgual

Open Source

Comparação Rápida

Modelos Similares

Nemotron 3 Ultra 550B A55B (Reasoning)

NVIDIA

Q: 37.8$1.18/1M

NVIDIA Nemotron 3 Super 120B A12B (Reasoning)

NVIDIA

Q: 25.4$0.41/1M

Mais barato: 54%

Nemotron Cascade 2 30B A3B

NVIDIA

Nemotron 3 Nano Omni 30B A3B Reasoning

NVIDIA

Q: 14.9$0.13/1M

Mais barato: 85%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.10/1M

Mais barato: 89%

Llama Nemotron Super 49B v1.5 (Reasoning)

NVIDIA

Q: 12.4$0.17/1M

Mais barato: 81%

Comparar todos os 7 modelos