Google: Gemini 2.5 Flash

Google·Released 2025-06-17

1.0M ctxMoEMultimodal

About

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Quality Index

14.1

264th of 537

Top 50%

Coding Index

17.8

247th of 447

Top 55%

Math Index

60.3

117th of 269

Top 43%

Price/1M

$0.85

402nd cheapest

56% above median

Top 61%

Speed

212 tok/s

Top 11%

TTFT

0.43s

Context Window

1.0M

17th largest

Top 10%

Market Position

Google: Gemini 2.5 FlashMarket Average

Pricing

Input

$0.30

per 1M tokens

Output

$2.50

per 1M tokens

Blended

$0.85

per 1M tokens

Cheaper than 39% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.85

Monthly

$25.50

vs. Similar Models

Upstage: Solar Pro 3Q:0.0

$0.26-69%

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)Q:+0.1

$0.10-89%

Qwen3 VL 235B A22B InstructQ:+0.2

$0.37-56%

GPT-5 mini (minimal)Q:+0.2

$0.69-19%

Performance

212

tokens/sec

Faster than 89% of models

0.43

seconds

Faster than 93% of models

0.43

seconds

Faster than 95% of models

Market Median

94 tok/s

125% faster

Median TTFT

1.11s

62% faster

Throughput/Dollar

250

tok/s per $/1M

Speed Comparison

Arcee AI: Trinity Large Thinking

211 tok/s-0%

Gemini 3.5 Flash (medium)

211 tok/s-1%

Google: Gemini 2.5 Flash Lite

211 tok/s-1%

Context Window

1.0M

tokens

Larger than 90% of models

Max Output

66K

tokens

6% of context

Benchmarks

MMLU-Pro

80.9%

GPQA Diamond

68.3%

HLE

5.1%

LiveCodeBench

49.5%

SciCode

29.1%

TerminalBench Hard

12.1%

MATH-500

93.2%

AIME

50.0%

AIME 2025

60.3%

IFBench

39.0%

Long Context Recall

45.9%

Tau2

14.9%

Market AverageTop Score

Quick Compare

Similar Models

Upstage: Solar Pro 3

Upstage

Q: 14.1$0.26/1M128K ctx

Cheaper: 69%Context Window: 8x smaller

ZAYA1-8B

Zyphra

Q: 14.1N/A/1M

Coding: -12.2

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

NVIDIA

Q: 14.2$0.10/1M

Slower: 68%Cheaper: 89%

K2-V2 (high)

MBZUAI

Q: 14.2N/A/1M

o1-mini

OpenAI

Q: 14.0N/A/1M

Qwen: Qwen3 VL 235B A22B Instruct

Alibaba

Q: 14.3$0.37/1M262K ctx

Slower: 77%Cheaper: 56%

Compare all 7 models

Used by Agents

Slate Agent View all 7 agents →

Google: Gemini 2.5 Flash

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models

Used by Agents

Market Position