Loading...
Loading...
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of the Qwen3.5-122B-A10B.
Quality Index
42.1
31st of 444
Top 7%
Coding Index
34.9
49th of 354
Top 14%
Price/1M
$0.82
450th cheapest
175% above median
Top 66%
Speed
91 tok/s
Top 28%
TTFT
1.32s
Context Window
262K
61st largest
Top 25%
Input
$0.30
per 1M tokens
Output
$2.40
per 1M tokens
Blended
$0.82
per 1M tokens
Cheaper than 34% of models. Median price is $0.30/1M tokens.
Daily
$0.82
Monthly
$24.75
91
tokens/sec
Faster than 72% of models
1.32
seconds
Faster than 21% of models
23.19
seconds
Faster than 12% of models
Market Median
45 tok/s
102% faster
Median TTFT
0.42s
217% slower
Throughput/Dollar
111
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 75% of models
Max Output
66K
tokens
25% of context
2.4M
760
24-48 GB
A6000 / M3 Ultra