Related Models
Google: Gemini 3.5 Flash2026-05-19Gemini 3.5 Flash (medium)2026-05-19Gemini 3.5 Flash (minimal)2026-05-19Google: Gemini 3.1 Flash Lite2026-05-07Google: Gemini 3.1 Flash Lite Preview2026-03-03Google: Gemini 3.1 Pro Preview Custom Tools2026-02-25Google: Gemini 3.1 Pro Preview2026-02-19Gemini 3 Deep Think2026-02-05
Pricing
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 72% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.17
Monthly
$5.25
vs. Similar Models
OpenAI: GPT-4oQ:-0.2
$4.38+2400%
Qwen3 VL 32B InstructQ:-0.3
$0.18+4%
Ministral 3 14BQ:-0.3
$0.20+14%
Claude 3 OpusQ:+0.4
$30.00+17043%
Performance
270
tokens/sec
Faster than 95% of models
21.61
seconds
Faster than 5% of models
21.61
seconds
Faster than 23% of models
Market Median
94 tok/s
188% faster
Median TTFT
1.10s
1855% slower
Throughput/Dollar
1543
tok/s per $/1M
Speed Comparison
gpt-oss-20B (low)
264 tok/s-2%
Nova Micro
262 tok/s-3%
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
292 tok/s+8%
Benchmarks
MMLU-Pro
75.9%
GPQA Diamond
62.5%
HLE
6.4%
LiveCodeBench
59.3%
SciCode
19.3%
TerminalBench Hard
4.5%
MATH-500
96.9%
AIME
70.3%
AIME 2025
53.3%
IFBench
49.9%
Long Context Recall
51.3%
Tau2
18.4%
Market AverageTop Score