Loading...
Loading...
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
Quality Index
33.5
78th of 444
Top 18%
Coding Index
30.1
82nd of 354
Top 23%
Price/1M
$0.56
405th cheapest
88% above median
Top 59%
Speed
220 tok/s
Top 5%
TTFT
8.23s
Context Window
1.0M
8th largest
Top 6%
Input
$0.25
per 1M tokens
Output
$1.50
per 1M tokens
Blended
$0.56
per 1M tokens
Cheaper than 41% of models. Median price is $0.30/1M tokens.
Daily
$0.56
Monthly
$16.89
220
tokens/sec
Faster than 95% of models
8.23
seconds
Faster than 8% of models
8.23
seconds
Faster than 23% of models
Market Median
45 tok/s
385% faster
Median TTFT
0.42s
1870% slower
Throughput/Dollar
390
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 94% of models
Max Output
66K
tokens
6% of context