About
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Related Models
Pricing
Input
$0.01
per 1M tokens
Output
$0.03
per 1M tokens
Blended
$0.01
per 1M tokens
Cheaper than 92% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.01
Monthly
$0.45
vs. Similar Models
Performance
194
tokens/sec
Faster than 86% of models
0.94
seconds
Faster than 58% of models
0.94
seconds
Faster than 71% of models
Market Median
94 tok/s
106% faster
Median TTFT
1.11s
15% faster
Throughput/Dollar
12933
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 62% of models
Max Output
33K
tokens
13% of context