Loading...
Loading...
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...
Input
$0.20
per 1M tokens
Output
$1.56
per 1M tokens
Blended
$0.54
per 1M tokens
Cheaper than 51% of models. Median price is $0.56/1M tokens.
Daily
$0.54
Monthly
$16.09
92
tokens/sec
Faster than 54% of models
1.42
seconds
Faster than 34% of models
23.05
seconds
Faster than 22% of models
Market Median
86 tok/s
8% faster
Median TTFT
1.07s
32% slower
Throughput/Dollar
172
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 66% of models
Max Output
66K
tokens
25% of context
3.3M
970
24-48 GB
A6000 / M3 Ultra
Quality Index
42.1
58th of 507
Top 12%
Coding Index
34.9
83rd of 417
Top 20%
Price/1M
$0.54
315th cheapest
4% below median
Top 49%
Speed
92 tok/s
Top 46%
TTFT
1.42s
Context Window
262K
91st largest
Top 34%