Qwen: Qwen3.5-122B-A10B
About
The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of...
Related Models
Pricing
Input
$0.26
per 1M tokens
Output
$2.08
per 1M tokens
Blended
$0.72
per 1M tokens
Cheaper than 45% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.72
Monthly
$21.45
vs. Similar Models
Performance
145
tokens/sec
Faster than 71% of models
1.13
seconds
Faster than 49% of models
14.96
seconds
Faster than 32% of models
Market Median
94 tok/s
53% faster
Median TTFT
1.11s
1% slower
Throughput/Dollar
202
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 62% of models
Max Output
262K
tokens
100% of context
Benchmarks
Open Source
779.3K
578
Multi-GPU
8x A100 / H100