Skip to main content
Back to Explore

Qwen: Qwen3.5-27B

Alibaba·Released 2026-02-25
Open Source27B262K ctxApache 2.0Multimodal

About

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...

Pricing

Input

$0.20

per 1M tokens

Output

$1.56

per 1M tokens

Blended

$0.54

per 1M tokens

Cheaper than 50% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.54

Monthly

$16.09

vs. Similar Models

Z.ai: GLM 4.7Q:0.0
$0.74+38%
MiniMax: MiniMax M2.5Q:-0.1
$0.21-61%
Qwen: Qwen3.5 397B A17BQ:-0.1
$0.90+68%
Claude 4.1 Opus (Reasoning)Q:-0.1
$30.00+5494%

Performance

84

tokens/sec

Faster than 43% of models

1.41

seconds

Faster than 36% of models

25.25

seconds

Faster than 20% of models

Market Median

94 tok/s

11% slower

Median TTFT

1.11s

27% slower

Throughput/Dollar

156

tok/s per $/1M

Speed Comparison

DeepSeek V4 Pro (Non-reasoning)
84 tok/s-0%
DeepSeek: DeepSeek V4 Pro
84 tok/s-0%
Mistral Medium
85 tok/s+1%

Context Window

262K

tokens

Larger than 62% of models

Max Output

66K

tokens

25% of context

Benchmarks

MMLU-ProNot evaluated
GPQA Diamond
85.8%
HLE
22.2%
LiveCodeBenchNot evaluated
SciCode
39.5%
TerminalBench Hard
32.6%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
75.6%
Long Context Recall
67.3%
Tau2
93.9%
Market AverageTop Score
apache-2.027BGGUF / GPTQ / AWQ
Downloads

2.6M

Likes

996

VRAM (FP16)

24-48 GB

GPU

A6000 / M3 Ultra

Quick Compare

Similar Models

Compare all 7 models