Loading...
Loading...
Olmo 3.1 32B Think is a large-scale, 32-billion-parameter model designed for deep reasoning, complex multi-step logic, and advanced instruction following. Building on the Olmo 3 series, version 3.1 delivers refined reasoning behavior and stronger performance across demanding evaluations and nuanced conversational tasks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Think continues the Olmo initiative’s commitment to openness, providing full transparency across model weights, code, and training methodology.
Input
$0.15
per 1M tokens
Output
$0.50
per 1M tokens
Blended
$0.24
per 1M tokens
Cheaper than 68% of models. Median price is $0.56/1M tokens.
Daily
$0.24
Monthly
$7.13
98
tokens/sec
Faster than 56% of models
0.44
seconds
Faster than 89% of models
20.91
seconds
Faster than 24% of models
Market Median
86 tok/s
14% faster
Median TTFT
1.07s
59% faster
Throughput/Dollar
411
tok/s per $/1M
Speed Comparison
Context Window
66K
tokens
Larger than 17% of models
Max Output
66K
tokens
100% of context
Quality Index
13.9
352nd of 507
Top 70%
Coding Index
9.8
326th of 417
Top 79%
Math Index
77.3
75th of 269
Top 28%
Price/1M
$0.24
203rd cheapest
58% below median
Top 32%
Speed
98 tok/s
Top 44%
TTFT
0.44s
Context Window
66K
331st largest
Top 83%