About
Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this...
Related Models
Pricing
Input
$0.20
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.30
per 1M tokens
Cheaper than 62% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.30
Monthly
$9.00
vs. Similar Models
Performance
54
tokens/sec
Faster than 24% of models
0.29
seconds
Faster than 97% of models
0.29
seconds
Faster than 99% of models
Market Median
94 tok/s
42% slower
Median TTFT
1.10s
74% faster
Throughput/Dollar
181
tok/s per $/1M
Speed Comparison
Context Window
66K
tokens
Larger than 13% of models
Max Output
16K
tokens
25% of context