Loading...
Loading...
Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per token, delivering performance comparable to models with 10 to 20x higher active compute, which makes it well suited for cost-sensitive, always-on agent deployment. The model is trained with a strong agentic focus and performs reliably on long-horizon coding tasks, complex tool usage, and recovery from execution failures. With a native 256k context window, it integrates cleanly into real-world CLI and IDE environments and adapts well to common agent scaffolds used by modern coding tools. The model operates exclusively in non-thinking mode and does not emit <think> blocks, simplifying integration for production coding agents.
Índice de Qualidade
28.3
113th de 444
Top 26%
Índice de Código
22.9
127th de 354
Top 36%
Preço/1M
$0.60
409th mais barato
100% acima da mediana
Top 60%
Velocidade
149 tok/s
Top 13%
TTFT
0.87s
Janela de Contexto
262K
61st maior
Top 25%
Entrada
$0.35
por 1M tokens
Saída
$1.20
por 1M tokens
Combinado
$0.60
por 1M tokens
Mais barato que 40% dos modelos. Preço mediano é $0.30/1M tokens.
Diário
$0.60
Mensal
$18.00
149
tokens/seg
Mais rápido que 87% dos modelos
0.87
segundos
Mais rápido que 34% dos modelos
0.87
segundos
Mais rápido que 41% dos modelos
Mediana do Mercado
45 tok/s
228% mais rápido
TTFT Mediano
0.42s
108% mais lento
Vazão/Dólar
248
tok/s por $/1M
Comparação de Velocidade
Janela de Contexto
262K
tokens
Maior que 75% dos modelos
Saída Máxima
66K
tokens
25% do contexto