About
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...
Related Models
Pricing
Input
$0.20
per 1M tokens
Output
$1.56
per 1M tokens
Blended
$0.54
per 1M tokens
Cheaper than 50% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.54
Monthly
$16.09
vs. Similar Models
Performance
84
tokens/sec
Faster than 43% of models
1.41
seconds
Faster than 36% of models
25.25
seconds
Faster than 20% of models
Market Median
94 tok/s
11% slower
Median TTFT
1.11s
27% slower
Throughput/Dollar
156
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 62% of models
Max Output
66K
tokens
25% of context
Benchmarks
Open Source
2.6M
996
24-48 GB
A6000 / M3 Ultra