Loading...
Loading...
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...
Input
$0.09
per 1M tokens
Output
$1.10
per 1M tokens
Blended
$0.34
per 1M tokens
Cheaper than 60% of models. Median price is $0.56/1M tokens.
Daily
$0.34
Monthly
$10.28
154
tokens/sec
Faster than 79% of models
1.10
seconds
Faster than 48% of models
1.10
seconds
Faster than 65% of models
Market Median
86 tok/s
79% faster
Median TTFT
1.07s
3% slower
Throughput/Dollar
449
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 66% of models
Max Output
16K
tokens
6% of context
777.1K
951
Multi-GPU
8x A100 / H100
Quality Index
20.1
242nd of 507
Top 48%
Coding Index
15.3
245th of 417
Top 59%
Math Index
66.3
104th of 269
Top 39%
Price/1M
$0.34
252nd cheapest
39% below median
Top 40%
Speed
154 tok/s
Top 21%
TTFT
1.10s
Context Window
262K
91st largest
Top 34%