Skip to main content
Back to Explore

inclusionAI: Ling-2.6-flash

InclusionAI·Released 2026-04-21
262K ctx

About

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Pricing

Input

$0.01

per 1M tokens

Output

$0.03

per 1M tokens

Blended

$0.01

per 1M tokens

Cheaper than 92% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.01

Monthly

$0.45

vs. Similar Models

MoonshotAI: Kimi K2 0711Q:+0.1
$1.00+6583%
OpenAI: GPT-4.1Q:+0.1
$3.50+23233%
Qwen3 Max (Preview)Q:-0.1
$2.40+15900%
GLM-4.5 (Reasoning)Q:+0.2
$1.00+6567%

Performance

194

tokens/sec

Faster than 86% of models

0.94

seconds

Faster than 58% of models

0.94

seconds

Faster than 71% of models

Market Median

94 tok/s

106% faster

Median TTFT

1.11s

15% faster

Throughput/Dollar

12933

tok/s per $/1M

Speed Comparison

Grok 4.20 0309 (Non-reasoning)
195 tok/s+1%
Qwen3.5 35B A3B (Non-reasoning)
193 tok/s-1%
Qwen: Qwen3.7 Max
196 tok/s+1%

Context Window

262K

tokens

Larger than 62% of models

Max Output

33K

tokens

13% of context

Benchmarks

MMLU-ProNot evaluated
GPQA Diamond
59.3%
HLE
6.2%
LiveCodeBenchNot evaluated
SciCode
27.1%
TerminalBench Hard
21.2%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
57.4%
Long Context Recall
25.0%
Tau2
86.0%
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models