Skip to main content
Back to Explore

Gemini 2.5 Flash-Lite (Reasoning)

Google·Released 2025-06-17
MoEMultimodal

Pricing

Input

$0.10

per 1M tokens

Output

$0.40

per 1M tokens

Blended

$0.17

per 1M tokens

Cheaper than 72% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.17

Monthly

$5.25

vs. Similar Models

OpenAI: GPT-4oQ:-0.2
$4.38+2400%
Qwen3 VL 32B InstructQ:-0.3
$0.18+4%
Ministral 3 14BQ:-0.3
$0.20+14%
Claude 3 OpusQ:+0.4
$30.00+17043%

Performance

270

tokens/sec

Faster than 95% of models

21.61

seconds

Faster than 5% of models

21.61

seconds

Faster than 23% of models

Market Median

94 tok/s

188% faster

Median TTFT

1.10s

1855% slower

Throughput/Dollar

1543

tok/s per $/1M

Speed Comparison

gpt-oss-20B (low)
264 tok/s-2%
Nova Micro
262 tok/s-3%
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
292 tok/s+8%

Benchmarks

MMLU-Pro
75.9%
GPQA Diamond
62.5%
HLE
6.4%
LiveCodeBench
59.3%
SciCode
19.3%
TerminalBench Hard
4.5%
MATH-500
96.9%
AIME
70.3%
AIME 2025
53.3%
IFBench
49.9%
Long Context Recall
51.3%
Tau2
18.4%
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models