Google: Gemini 3.1 Flash Lite Preview

Google·Released 2026-03-03

1.0M ctxMultimodal

About

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

Quality Index

25.0

147th of 537

Top 28%

Coding Index

34.7

112th of 447

Top 25%

Price/1M

$0.56

348th cheapest

3% above median

Top 51%

Speed

329 tok/s

Top 4%

TTFT

4.92s

Context Window

1.0M

17th largest

Top 10%

Market Position

Google: Gemini 3.1 Flash Lite PreviewMarket Average

Pricing

Input

$0.25

per 1M tokens

Output

$1.50

per 1M tokens

Blended

$0.56

per 1M tokens

Cheaper than 49% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.56

Monthly

$16.88

vs. Similar Models

Qwen: Qwen3.5-9BQ:0.0

$0.11-80%

Qwen3 Max Thinking (Preview)Q:0.0

$2.40+327%

GLM-4.6 (Reasoning)Q:+0.1

$0.96+71%

Gemma 4 31B (Non-reasoning)Q:-0.2

$0.20-64%

Performance

329

tokens/sec

Faster than 96% of models

4.92

seconds

Faster than 17% of models

4.92

seconds

Faster than 47% of models

Market Median

94 tok/s

248% faster

Median TTFT

1.11s

342% slower

Throughput/Dollar

584

tok/s per $/1M

Speed Comparison

LFM2 2.6B

335 tok/s+2%

gpt-oss-120b (low)

340 tok/s+4%

Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)

347 tok/s+5%

Context Window

1.0M

tokens

Larger than 90% of models

Max Output

66K

tokens

6% of context

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

82.2%

HLE

16.2%

LiveCodeBenchNot evaluated

SciCode

41.9%

TerminalBench Hard

24.2%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

77.2%

Long Context Recall

65.3%

Tau2

31.3%

Market AverageTop Score

Quick Compare

Similar Models

Qwen: Qwen3.5-9B

Alibaba

Q: 25.0$0.11/1M262K ctx

Slower: 79%Cheaper: 80%

Qwen3 Max Thinking (Preview)

Alibaba

Q: 25.0$2.40/1M

Slower: 84%Pricier: 327%

GLM-4.6 (Reasoning)

Z AI

Q: 25.1$0.96/1M

Slower: 83%Pricier: 71%

Gemma 4 31B (Non-reasoning)

Google

Q: 24.8$0.20/1M

Slower: 83%Cheaper: 64%

Grok 4.3 (Non-reasoning)

xAI

Q: 24.8$1.56/1M

Slower: 64%Pricier: 178%

Inception: Mercury 2

Inception

Q: 25.3$0.38/1M128K ctx

Faster: 220%Cheaper: 33%

Compare all 7 models

Google: Gemini 3.1 Flash Lite Preview

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models

Market Position