Loading...
Loading...
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 73% of models. Median price is $0.56/1M tokens.
Daily
$0.17
Monthly
$5.25
Context Window
131K
tokens
Larger than 33% of models
Max Output
16K
tokens
13% of context
Context Window Comparison
Price/1M
$0.17
161st cheapest
69% below median
Top 27%
Context Window
131K
201st largest
Top 67%