Skip to main content
Back to Explore

Google: Gemini 3.1 Flash Lite

Google·Released 2026-05-07
1.0M ctxMultimodal

About

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Pricing

Input

$0.25

per 1M tokens

Output

$1.50

per 1M tokens

Blended

$0.56

per 1M tokens

Cheaper than 49% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.56

Monthly

$16.88

vs. Similar Models

Google: Gemini 3.1 Flash Lite Preview
$0.560%
Qwen: Qwen3.7 Plus
$0.56-0%
Llama 3.1 Instruct 70B
$0.56-0%
Meta: Llama 3 70B Instruct
$0.57+1%

Performance

Context Window

1.0M

tokens

Larger than 90% of models

Max Output

66K

tokens

6% of context

Context Window Comparison

Z.ai: GLM 5.2
1.0MSame
Google: Gemini 3.5 Flash
1.0MSame
Google: Gemini 3.1 Pro Preview
1.0MSame

Quick Compare

Similar Models

Compare all 7 models