Google: Gemini 2.5 Flash Lite Preview 09-2025

Google·Released 2025-09-25

1.0M ctxMoEMultimodal

About

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Quality Index

13.1

277th of 537

Top 52%

Coding Index

14.5

282nd of 447

Top 64%

Math Index

46.7

145th of 269

Top 55%

Price/1M

$0.17

178th cheapest

68% below median

Top 28%

Speed

353 tok/s

Top 2%

TTFT

0.43s

Context Window

1.0M

17th largest

Top 10%

Market Position

Google: Gemini 2.5 Flash Lite Preview 09-2025Market Average

Pricing

Input

$0.10

per 1M tokens

Output

$0.40

per 1M tokens

Blended

$0.17

per 1M tokens

Cheaper than 72% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.17

Monthly

$5.25

vs. Similar Models

Gemma 4 12B (Non-reasoning)Q:+0.1

$0.15-14%

Qwen3 VL 30B A3B (Reasoning)Q:+0.2

$0.34+93%

QwQ 32BQ:+0.3

$0.74+326%

Qwen3 235B A22B (Reasoning)Q:+0.3

$2.63+1400%

Performance

353

tokens/sec

Faster than 98% of models

0.43

seconds

Faster than 92% of models

0.43

seconds

Faster than 95% of models

Market Median

94 tok/s

275% faster

Median TTFT

1.11s

61% faster

Throughput/Dollar

2020

tok/s per $/1M

Speed Comparison

Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)

347 tok/s-2%

Granite 3.3 8B (Non-reasoning)

362 tok/s+2%

gpt-oss-120b (low)

340 tok/s-4%

Context Window

1.0M

tokens

Larger than 90% of models

Max Output

66K

tokens

6% of context

Benchmarks

MMLU-Pro

79.6%

GPQA Diamond

65.1%

HLE

4.6%

LiveCodeBench

64.1%

SciCode

28.5%

TerminalBench Hard

7.6%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

46.7%

IFBench

41.8%

Long Context Recall

48.0%

Tau2

30.4%

Market AverageTop Score

Quick Compare

Similar Models

Devstral Small 2

Mistral

Q: 13.1N/A/1M

Slower: 85%Coding: +14.8

Gemma 4 12B (Non-reasoning)

Google

Q: 13.2$0.15/1M

Slower: 55%Cheaper: 14%

Gemini 2.0 Flash Thinking Experimental (Jan '25)

Google

Q: 13.3N/A/1M

Coding: +9.6

Qwen3 VL 30B A3B (Reasoning)

Alibaba

Q: 13.3$0.34/1M

Slower: 65%Pricier: 93%

Motif-2-12.7B-Reasoning

Motif Technologies

Q: 12.8N/A/1M

Ling-1T

InclusionAI

Q: 12.8N/A/1M

Coding: +4.3

Compare all 7 models

Used by Agents

Gobii

Google: Gemini 2.5 Flash Lite Preview 09-2025

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models

Used by Agents

Market Position