Olmo 3.1 32B Instruct

Allen AI·Released 2026-01-13

Open Source66K ctx

About

Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this...

Quality Index

6.5

419th of 537

Top 78%

Coding Index

5.6

395th of 447

Top 89%

Price/1M

$0.30

255th cheapest

45% below median

Top 38%

Speed

54 tok/s

Top 76%

TTFT

0.29s

Context Window

66K

376th largest

Top 87%

Market Position

Olmo 3.1 32B InstructMarket Average

Pricing

Input

$0.20

per 1M tokens

Output

$0.60

per 1M tokens

Blended

$0.30

per 1M tokens

Cheaper than 62% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.30

Monthly

$9.00

vs. Similar Models

Sarvam 30B (high)Q:+0.1

$0.05-84%

AllenAI: Olmo 3 32B ThinkQ:-0.1

$0.24-21%

Mistral: SabaQ:-0.1

$0.300%

IBM: Granite 4.1 8BQ:+0.2

$0.06-79%

Performance

tokens/sec

Faster than 24% of models

0.29

seconds

Faster than 97% of models

0.29

seconds

Faster than 99% of models

Market Median

94 tok/s

42% slower

Median TTFT

1.10s

74% faster

Throughput/Dollar

181

tok/s per $/1M

Speed Comparison

Anthropic: Claude Opus 4.7

54 tok/s-1%

Gemma 3 1B Instruct

55 tok/s+1%

Claude Opus 4.6 (Adaptive Reasoning, Max Effort)

55 tok/s+1%

Context Window

66K

tokens

Larger than 13% of models

Max Output

16K

tokens

25% of context

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

53.9%

HLE

4.9%

LiveCodeBenchNot evaluated

SciCode

16.7%

TerminalBench Hard

0.0%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

39.2%

Long Context Recall

0.0%

Tau2

21.3%

Market AverageTop Score

Open Source

Quick Compare

Similar Models

AllenAI: Olmo 3 32B Think

Allen AI

Q: 6.4$0.24/1M66K ctx

Cheaper: 21%Coding: +4.9

Mistral: Saba

Mistral

Q: 6.4$0.30/1M33K ctx

Context Window: 2x smaller

DeepSeek R1 Distill Llama 8B

DeepSeek

Q: 6.4N/A/1M

Sarvam 30B (high)

Sarvam

Q: 6.6$0.05/1M

Faster: 349%Cheaper: 84%

Gemini 2.0 Flash Thinking Experimental (Dec '24)

Google

Q: 6.6N/A/1M

DeepSeek-V2.5

DeepSeek

Q: 6.6N/A/1M

Compare all 7 models

Olmo 3.1 32B Instruct

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models

Market Position