Loading...
Loading...
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. Reasoning Details must be preserved when using multi-turn tool calling, see our docs here: https://openrouter.ai/docs/use-cases/reasoning-tokens#preserving-reasoning. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, Gemini 3.1 Pro Preview improves long-horizon stability and tool orchestration while increasing token efficiency. It introduces a new medium thinking level to better balance cost, speed, and performance. The model excels in agentic coding, structured planning, multimodal analysis, and workflow automation, making it well-suited for autonomous agents, financial modeling, spreadsheet automation, and high-context enterprise tasks.
Quality Index
57.2
1st of 444
Top 0%
Coding Index
55.5
2nd of 354
Top 1%
Price/1M
$4.50
615th cheapest
1400% above median
Top 91%
Speed
114 tok/s
Top 24%
TTFT
23.00s
Context Window
1.0M
8th largest
Top 6%
Input
$2.00
per 1M tokens
Output
$12.00
per 1M tokens
Blended
$4.50
per 1M tokens
Cheaper than 9% of models. Median price is $0.30/1M tokens.
Daily
$4.50
Monthly
$135.00
114
tokens/sec
Faster than 76% of models
23.00
seconds
Faster than 4% of models
23.00
seconds
Faster than 12% of models
Market Median
45 tok/s
151% faster
Median TTFT
0.42s
5402% slower
Throughput/Dollar
25
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 94% of models
Max Output
66K
tokens
6% of context