Loading...
Loading...
Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this...
Input
$0.20
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.30
per 1M tokens
Cheaper than 62% of models. Median price is $0.56/1M tokens.
Daily
$0.30
Monthly
$9.00
54
tokens/sec
Faster than 25% of models
0.29
seconds
Faster than 96% of models
0.29
seconds
Faster than 97% of models
Market Median
86 tok/s
37% slower
Median TTFT
1.07s
73% faster
Throughput/Dollar
181
tok/s per $/1M
Speed Comparison
Context Window
66K
tokens
Larger than 17% of models
Max Output
16K
tokens
25% of context
Quality Index
12.2
389th of 507
Top 77%
Coding Index
5.6
366th of 417
Top 88%
Price/1M
$0.30
235th cheapest
46% below median
Top 38%
Speed
54 tok/s
Top 75%
TTFT
0.29s
Context Window
66K
331st largest
Top 83%