Qwen: Qwen3.5 397B A17B
About
The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers...
Related Models
Pricing
Input
$0.39
per 1M tokens
Output
$2.45
per 1M tokens
Blended
$0.90
per 1M tokens
Cheaper than 37% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.90
Monthly
$27.04
vs. Similar Models
Performance
52
tokens/sec
Faster than 18% of models
1.68
seconds
Faster than 30% of models
63.15
seconds
Faster than 4% of models
Market Median
94 tok/s
45% slower
Median TTFT
1.11s
51% slower
Throughput/Dollar
58
tok/s per $/1M
Speed Comparison
Context Window
256K
tokens
Larger than 58% of models
Max Output
64K
tokens
25% of context
Benchmarks
Open Source
590.0K
1.5K
Multi-GPU
8x A100 / H100