Olmo 3.1 32B Instruct

Allen AI·Released 2026-01-13

Open Source66K ctx

About

Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this...

Quality Index

12.2

389th of 507

Top 77%

Coding Index

5.6

366th of 417

Top 88%

Price/1M

$0.30

235th cheapest

46% below median

Top 38%

Speed

54 tok/s

Top 75%

TTFT

0.29s

Context Window

66K

331st largest

Top 83%

Market Position

Olmo 3.1 32B InstructMarket Average

Pricing

Input

$0.20

per 1M tokens

Output

$0.60

per 1M tokens

Blended

$0.30

per 1M tokens

Cheaper than 62% of models. Median price is $0.56/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.30

Monthly

$9.00

vs. Similar Models

AllenAI: Olmo 3 32B ThinkQ:-0.1

$0.24-21%

Mistral: SabaQ:-0.1

$0.300%

Anthropic: Claude 3 HaikuQ:+0.1

$0.50+67%

Qwen: Qwen-TurboQ:-0.2

$0.06-81%

Performance

54

tokens/sec

Faster than 25% of models

0.29

seconds

Faster than 96% of models

0.29

seconds

Faster than 97% of models

Market Median

86 tok/s

37% slower

Median TTFT

1.07s

73% faster

Throughput/Dollar

181

tok/s per $/1M

Speed Comparison

Gemma 4 E4B (Non-reasoning)

54 tok/s-1%

Anthropic: Claude Sonnet 4.6

54 tok/s-1%

Qwen3.5 Omni Plus

55 tok/s+1%

Context Window

66K

tokens

Larger than 17% of models

Max Output

16K

tokens

25% of context

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

53.9%

HLE

4.9%

LiveCodeBenchNot evaluated

SciCode

16.7%

TerminalBench Hard

0.0%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

39.2%

Long Context Recall

0.0%

Tau2

21.3%

Market AverageTop Score

Open Source

Quick Compare

Similar Models

AllenAI: Olmo 3 32B Think

Allen AI

Q: 12.1$0.24/1M66K ctx

Cheaper: 21%Coding: +4.9

Mistral: Saba

Mistral

Q: 12.1$0.30/1M33K ctx

Context Window: 2x smaller

DeepSeek R1 Distill Llama 8B

DeepSeek

Q: 12.1N/A/1M

Gemma 4 E2B (Non-reasoning)

Google

Q: 12.1N/A/1M

Anthropic: Claude 3 Haiku

Anthropic

Q: 12.3$0.50/1M200K ctx

Faster: 148%Pricier: 67%

Sarvam 30B (high)

Sarvam

Q: 12.3N/A/1M

Faster: 211%

Compare all 7 models

Quality Index

12.2

389th of 507

Top 77%

Coding Index

5.6

366th of 417

Top 88%

Price/1M

$0.30

235th cheapest

46% below median

Top 38%

Speed

54 tok/s

Top 75%

TTFT

0.29s

Context Window

66K

331st largest

Top 83%

Market Position

Olmo 3.1 32B InstructMarket Average

Olmo 3.1 32B Instruct

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models

Market Position