Related Models
Google: Gemini 3.5 Flash2026-05-19Gemini 3.5 Flash (medium)2026-05-19Gemini 3.5 Flash (minimal)2026-05-19Google: Gemini 3.1 Flash Lite2026-05-07Google: Gemini 3.1 Flash Lite Preview2026-03-03Google: Gemini 3.1 Pro Preview Custom Tools2026-02-25Google: Gemini 3.1 Pro Preview2026-02-19Gemini 3 Deep Think2026-02-05
Pricing
Input
$2.00
per 1M tokens
Output
$12.00
per 1M tokens
Blended
$4.50
per 1M tokens
Cheaper than 12% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$4.50
Monthly
$135.00
vs. Similar Models
Qwen: Qwen3.6 PlusQ:0.0
$0.73-84%
Z.ai: GLM 5Q:-0.1
$0.93-79%
Grok Build 0.1 0616Q:+0.2
$1.25-72%
OpenAI: GPT-5.4 MiniQ:+0.4
$1.69-63%
Performance
161
tokens/sec
Faster than 78% of models
32.20
seconds
Faster than 3% of models
32.20
seconds
Faster than 16% of models
Market Median
95 tok/s
71% faster
Median TTFT
1.11s
2809% slower
Throughput/Dollar
36
tok/s per $/1M
Speed Comparison
GPT-5.4 nano (Non-Reasoning)
162 tok/s+0%
Gemma 4 12B (Reasoning)
161 tok/s-0%
GPT-5.4 mini (Non-Reasoning)
162 tok/s+1%
Benchmarks
MMLU-Pro
89.8%
GPQA Diamond
90.8%
HLE
37.2%
LiveCodeBench
91.7%
SciCode
56.1%
TerminalBench Hard
41.7%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
95.7%
IFBench
70.4%
Long Context Recall
70.7%
Tau2
87.1%
Market AverageTop Score