OpenAI: gpt-oss-120b

OpenAI·Released 2025-08-05

Open Source120B131K ctxApache 2.0

About

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

Quality Index

23.8

158th of 537

Top 30%

Coding Index

30.4

145th of 447

Top 32%

Math Index

93.4

15th of 269

Top 6%

Price/1M

$0.06

77th cheapest

89% below median

Top 12%

Speed

307 tok/s

Top 4%

TTFT

0.54s

Context Window

131K

236th largest

Top 73%

Market Position

OpenAI: gpt-oss-120bMarket Average

Pricing

Input

$0.03

per 1M tokens

Output

$0.15

per 1M tokens

Blended

$0.06

per 1M tokens

Cheaper than 88% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.06

Monthly

$1.80

vs. Similar Models

Claude 4.5 Haiku (Non-reasoning)Q:-0.1

$2.00+3233%

Qwen: Qwen3 MaxQ:+0.2

$1.56+2500%

Claude 3.7 Sonnet (Non-reasoning)Q:-0.3

$6.00+9900%

MoonshotAI: Kimi K2 0905Q:-0.3

$1.07+1692%

Performance

307

tokens/sec

Faster than 96% of models

0.54

seconds

Faster than 84% of models

7.06

seconds

Faster than 44% of models

Market Median

94 tok/s

225% faster

Median TTFT

1.11s

52% faster

Throughput/Dollar

5112

tok/s per $/1M

Speed Comparison

Llama 3.1 Nemotron Instruct 70B

301 tok/s-2%

Nemotron 3 Nano Omni 30B A3B Reasoning

298 tok/s-3%

Nova Micro

289 tok/s-6%

Context Window

131K

tokens

Larger than 27% of models

Max Output

131K

tokens

100% of context

Benchmarks

MMLU-Pro

80.8%

GPQA Diamond

78.2%

HLE

18.5%

LiveCodeBench

87.8%

SciCode

38.9%

TerminalBench Hard

23.5%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

93.4%

IFBench

69.0%

Long Context Recall

50.7%

Tau2

65.8%

Market AverageTop Score

Open Source

View model repository

apache-2.0120BGGUF / GPTQ / AWQ

Downloads

4.0M

Likes

4.9K

VRAM (FP16)

Multi-GPU

GPU

8x A100 / H100

Quick Compare

Similar Models

Gemini 2.5 Flash Preview (Sep '25) (Reasoning)

Google

Q: 23.8N/A/1M

Coding: -5.8

Claude 4.5 Haiku (Non-reasoning)

Anthropic

Q: 23.7$2.00/1M

Slower: 66%Pricier: 3233%

Qwen: Qwen3 Max

Alibaba

Q: 24.0$1.56/1M262K ctx

Slower: 81%Pricier: 2500%

MoonshotAI: Kimi K2 0905

Kimi

Q: 23.5$1.07/1M262K ctx

Slower: 91%Pricier: 1692%

Claude 3.7 Sonnet (Non-reasoning)

Anthropic

Q: 23.5$6.00/1M200K ctx

Pricier: 9900%Coding: -3.7

Qwen3.6 35B A3B (Non-reasoning)

Alibaba

Q: 24.2$0.84/1M

Slower: 51%Pricier: 1307%

Compare all 7 models

Used by Agents

Roo Code View all 9 agents →

OpenAI: gpt-oss-120b

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models

Used by Agents

Market Position