Skip to main content
Back to Explore

Google: Gemini 3.1 Flash Lite Preview

Google·Released 2026-03-03
1.0M ctxMultimodal

About

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

Pricing

Input

$0.25

per 1M tokens

Output

$1.50

per 1M tokens

Blended

$0.56

per 1M tokens

Cheaper than 49% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.56

Monthly

$16.88

vs. Similar Models

Qwen: Qwen3.5-9BQ:0.0
$0.11-80%
Qwen3 Max Thinking (Preview)Q:0.0
$2.40+327%
GLM-4.6 (Reasoning)Q:+0.1
$0.96+71%
Gemma 4 31B (Non-reasoning)Q:-0.2
$0.20-64%

Performance

329

tokens/sec

Faster than 96% of models

4.92

seconds

Faster than 17% of models

4.92

seconds

Faster than 47% of models

Market Median

94 tok/s

248% faster

Median TTFT

1.11s

342% slower

Throughput/Dollar

584

tok/s per $/1M

Speed Comparison

LFM2 2.6B
335 tok/s+2%
gpt-oss-120b (low)
340 tok/s+4%
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
347 tok/s+5%

Context Window

1.0M

tokens

Larger than 90% of models

Max Output

66K

tokens

6% of context

Benchmarks

MMLU-ProNot evaluated
GPQA Diamond
82.2%
HLE
16.2%
LiveCodeBenchNot evaluated
SciCode
41.9%
TerminalBench Hard
24.2%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
77.2%
Long Context Recall
65.3%
Tau2
31.3%
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models