Google: Gemini 2.5 Flash Lite Preview 09-2025
About
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 72% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.17
Monthly
$5.25
vs. Similar Models
Performance
353
tokens/sec
Faster than 98% of models
0.43
seconds
Faster than 92% of models
0.43
seconds
Faster than 95% of models
Market Median
94 tok/s
275% faster
Median TTFT
1.11s
61% faster
Throughput/Dollar
2020
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 90% of models
Max Output
66K
tokens
6% of context