Qwen: Qwen3.5-35B-A3B
About
The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall...
Related Models
Pricing
Input
$0.14
per 1M tokens
Output
$1.00
per 1M tokens
Blended
$0.35
per 1M tokens
Cheaper than 58% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.35
Monthly
$10.65
vs. Similar Models
Performance
163
tokens/sec
Faster than 79% of models
1.15
seconds
Faster than 47% of models
13.40
seconds
Faster than 35% of models
Market Median
94 tok/s
73% faster
Median TTFT
1.11s
3% slower
Throughput/Dollar
460
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 62% of models
Max Output
82K
tokens
31% of context
Benchmarks
Open Source
2.0M
1.4K
48-80 GB
A100 80GB