About
Olmo 3.1 32B Think is a large-scale, 32-billion-parameter model designed for deep reasoning, complex multi-step logic, and advanced instruction following. Building on the Olmo 3 series, version 3.1 delivers refined reasoning behavior and stronger performance across demanding evaluations and nuanced conversational tasks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Think continues the Olmo initiative’s commitment to openness, providing full transparency across model weights, code, and training methodology.
Related Models
Pricing
Input
$0.15
per 1M tokens
Output
$0.50
per 1M tokens
Blended
$0.24
per 1M tokens
Cheaper than 67% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.24
Monthly
$7.13
vs. Similar Models
Performance
98
tokens/sec
Faster than 52% of models
0.44
seconds
Faster than 93% of models
20.91
seconds
Faster than 24% of models
Market Median
94 tok/s
4% faster
Median TTFT
1.10s
60% faster
Throughput/Dollar
411
tok/s per $/1M
Speed Comparison
Context Window
66K
tokens
Larger than 13% of models
Max Output
66K
tokens
100% of context