About
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Related Models
Google: Gemini 3.5 Flash2026-05-19Gemini 3.5 Flash (medium)2026-05-19Gemini 3.5 Flash (minimal)2026-05-19Google: Gemini 3.1 Flash Lite Preview2026-03-03Google: Gemini 3.1 Pro Preview Custom Tools2026-02-25Google: Gemini 3.1 Pro Preview2026-02-19Gemini 3 Deep Think2026-02-05Gemini 3 Flash Preview (Reasoning)2025-12-17
Pricing
Input
$0.25
per 1M tokens
Output
$1.50
per 1M tokens
Blended
$0.56
per 1M tokens
Cheaper than 49% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.56
Monthly
$16.88
vs. Similar Models
Google: Gemini 3.1 Flash Lite Preview
$0.560%
Qwen: Qwen3.7 Plus
$0.56-0%
Llama 3.1 Instruct 70B
$0.56-0%
Meta: Llama 3 70B Instruct
$0.57+1%
Performance
Context Window
1.0M
tokens
Larger than 90% of models
Max Output
66K
tokens
6% of context
Context Window Comparison
Z.ai: GLM 5.2
1.0MSame
Google: Gemini 3.5 Flash
1.0MSame
Google: Gemini 3.1 Pro Preview
1.0MSame