Qwen: Qwen3.6 35B A3B

Alibaba·Released 2026-04-27

Open Source35B262K ctxApache 2.0Multimodal

About

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated...

Quality Index

31.6

93rd of 537

Top 17%

Coding Index

41.9

67th of 447

Top 15%

Price/1M

$0.35

282nd cheapest

35% below median

Top 42%

Speed

147 tok/s

Top 27%

TTFT

1.36s

Context Window

262K

110th largest

Top 38%

Market Position

Qwen: Qwen3.6 35B A3BMarket Average

Pricing

Input

$0.14

per 1M tokens

Output

$1.00

per 1M tokens

Blended

$0.35

per 1M tokens

Cheaper than 58% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.35

Monthly

$10.65

vs. Similar Models

Qwen: Qwen3 Max ThinkingQ:+0.1

$1.56+339%

MiniMax: MiniMax M2.1Q:-0.2

$0.45+28%

Qwen3.5 397B A17B (Non-reasoning)Q:+0.4

$1.35+280%

DeepSeek V4 Pro (Non-reasoning)Q:-0.4

$0.54+53%

Performance

147

tokens/sec

Faster than 73% of models

1.36

seconds

Faster than 38% of models

38.03

seconds

Faster than 12% of models

Market Median

94 tok/s

56% faster

Median TTFT

1.11s

23% slower

Throughput/Dollar

414

tok/s per $/1M

Speed Comparison

GPT-3.5 Turbo

147 tok/s-0%

GLM-4.7-Flash (Non-reasoning)

148 tok/s+0%

Mistral Small 3

146 tok/s-1%

Context Window

262K

tokens

Larger than 62% of models

Max Output

262K

tokens

100% of context

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

84.1%

HLE

20.2%

LiveCodeBenchNot evaluated

SciCode

35.8%

TerminalBench Hard

34.8%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

64.4%

Long Context Recall

63.7%

Tau2

95.3%

Market AverageTop Score

Open Source

View model repository

apache-2.035BGGUF / GPTQ / AWQ

Downloads

5.6M

Likes

2.3K

VRAM (FP16)

48-80 GB

GPU

A100 80GB

Quick Compare

Similar Models

Qwen: Qwen3 Max Thinking

Alibaba

Q: 31.7$1.56/1M262K ctx

Slower: 70%Pricier: 339%

MiniMax: MiniMax M2.1

MiniMax

Q: 31.4$0.45/1M205K ctx

Faster: 53%Pricier: 28%

Qwen3.5 397B A17B (Non-reasoning)

Alibaba

Q: 32.0$1.35/1M

Slower: 64%Pricier: 280%

GPT-5 (low)

OpenAI

Q: 31.2$3.44/1M

Slower: 46%Pricier: 868%

MiMo-V2-Flash (Reasoning)

Xiaomi

Q: 31.2$0.15/1M

Slower: 38%Cheaper: 58%

DeepSeek V4 Pro (Non-reasoning)

DeepSeek

Q: 31.2$0.54/1M

Slower: 43%Pricier: 53%

Compare all 7 models

Qwen: Qwen3.6 35B A3B

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models

Market Position