inclusionAI: Ling-2.6-flash

InclusionAI·Released 2026-04-21

262K ctx

About

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Quality Index

19.3

204th of 537

Top 38%

Coding Index

23.2

201st of 447

Top 45%

Price/1M

$0.01

56th cheapest

97% below median

Top 8%

Speed

194 tok/s

Top 14%

TTFT

0.94s

Context Window

262K

110th largest

Top 38%

Market Position

inclusionAI: Ling-2.6-flashMarket Average

Pricing

Input

$0.01

per 1M tokens

Output

$0.03

per 1M tokens

Blended

$0.01

per 1M tokens

Cheaper than 92% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.01

Monthly

$0.45

vs. Similar Models

MoonshotAI: Kimi K2 0711Q:+0.1

$1.00+6583%

OpenAI: GPT-4.1Q:+0.1

$3.50+23233%

Qwen3 Max (Preview)Q:-0.1

$2.40+15900%

GLM-4.5 (Reasoning)Q:+0.2

$1.00+6567%

Performance

194

tokens/sec

Faster than 86% of models

0.94

seconds

Faster than 58% of models

0.94

seconds

Faster than 71% of models

Market Median

94 tok/s

106% faster

Median TTFT

1.11s

15% faster

Throughput/Dollar

12933

tok/s per $/1M

Speed Comparison

Grok 4.20 0309 (Non-reasoning)

195 tok/s+1%

Qwen3.5 35B A3B (Non-reasoning)

193 tok/s-1%

Qwen: Qwen3.7 Max

196 tok/s+1%

Context Window

262K

tokens

Larger than 62% of models

Max Output

33K

tokens

13% of context

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

59.3%

HLE

6.2%

LiveCodeBenchNot evaluated

SciCode

27.1%

TerminalBench Hard

21.2%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

57.4%

Long Context Recall

25.0%

Tau2

86.0%

Market AverageTop Score

Quick Compare

Similar Models

MoonshotAI: Kimi K2 0711

Kimi

Q: 19.4$1.00/1M131K ctx

Slower: 86%Pricier: 6583%

OpenAI: GPT-4.1

OpenAI

Q: 19.4$3.50/1M1.0M ctx

Slower: 31%Pricier: 23233%

Devstral 2

Mistral

Q: 19.2N/A/1M

Slower: 77%Coding: +8.1

Qwen3 Max (Preview)

Alibaba

Q: 19.2$2.40/1M

Slower: 72%Pricier: 15900%

GLM-4.5 (Reasoning)

Z AI

Q: 19.5$1.00/1M

Slower: 75%Pricier: 6567%

OpenAI: o3 Mini

OpenAI

Q: 19.0$1.93/1M200K ctx

Faster: 18%Pricier: 12733%

Compare all 7 models

inclusionAI: Ling-2.6-flash

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models

Market Position