Related Models
Google: Gemini 3.5 Flash2026-05-19Gemini 3.5 Flash (medium)2026-05-19Gemini 3.5 Flash (minimal)2026-05-19Google: Gemini 3.1 Flash Lite2026-05-07Google: Gemini 3.1 Flash Lite Preview2026-03-03Google: Gemini 3.1 Pro Preview Custom Tools2026-02-25Google: Gemini 3.1 Pro Preview2026-02-19Gemini 3 Deep Think2026-02-05
Pricing
Input
$2.00
per 1M tokens
Output
$12.00
per 1M tokens
Blended
$4.50
per 1M tokens
Cheaper than 12% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$4.50
Monthly
$135.00
vs. Similar Models
MiMo-V2-Flash (Feb 2026)Q:+0.1
$0.15-97%
OpenAI: GPT-5 MiniQ:-0.1
$0.69-85%
Grok 4Q:+0.2
$11.00+144%
DeepSeek V3.2 (Reasoning)Q:+0.3
$0.34-93%
Performance
120
tokens/sec
Faster than 62% of models
3.62
seconds
Faster than 19% of models
3.62
seconds
Faster than 48% of models
Market Median
94 tok/s
27% faster
Median TTFT
1.11s
225% slower
Throughput/Dollar
27
tok/s per $/1M
Speed Comparison
Qwen: Qwen3 VL 30B A3B Instruct
120 tok/s-0%
Grok 4.3 (Non-reasoning)
119 tok/s-1%
Z.ai: GLM 4.7
119 tok/s-1%
Benchmarks
MMLU-Pro
89.5%
GPQA Diamond
88.7%
HLE
27.6%
LiveCodeBench
85.7%
SciCode
49.9%
TerminalBench Hard
34.1%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
86.7%
IFBench
49.7%
Long Context Recall
67.3%
Tau2
68.1%
Market AverageTop Score