OpenAI: GPT-4.1

OpenAI·Released 2025-04-14

1.0M ctxMultimodal

About

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

Quality Index

19.4

202nd of 537

Top 38%

Coding Index

21.8

217th of 447

Top 49%

Math Index

34.7

175th of 269

Top 65%

Price/1M

$3.50

569th cheapest

544% above median

Top 85%

Speed

122 tok/s

Top 35%

TTFT

0.57s

Context Window

1.0M

46th largest

Top 11%

Market Position

OpenAI: GPT-4.1Market Average

Pricing

Input

$2.00

per 1M tokens

Output

$8.00

per 1M tokens

Blended

$3.50

per 1M tokens

Cheaper than 15% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$3.50

Monthly

$105.00

vs. Similar Models

MoonshotAI: Kimi K2 0711Q:0.0

$1.00-71%

inclusionAI: Ling-2.6-flashQ:-0.1

$0.01-100%

GLM-4.5 (Reasoning)Q:+0.1

$1.00-71%

Qwen3 Max (Preview)Q:-0.2

$2.40-31%

Performance

122

tokens/sec

Faster than 65% of models

0.57

seconds

Faster than 80% of models

0.57

seconds

Faster than 86% of models

Market Median

94 tok/s

30% faster

Median TTFT

1.10s

48% faster

Throughput/Dollar

tok/s per $/1M

Speed Comparison

GPT-4o (Aug '24)

121 tok/s-0%

Qwen: Qwen3 VL 30B A3B Instruct

121 tok/s-1%

Nex AGI: Nex-N2-Pro

121 tok/s-1%

Context Window

1.0M

tokens

Larger than 89% of models

Max Output

33K

tokens

3% of context

Benchmarks

MMLU-Pro

80.6%

GPQA Diamond

66.6%

HLE

4.6%

LiveCodeBench

45.7%

SciCode

38.1%

TerminalBench Hard

13.6%

MATH-500

91.3%

AIME

43.7%

AIME 2025

34.7%

IFBench

43.0%

Long Context Recall

61.0%

Tau2

47.1%

Market AverageTop Score

Quick Compare

Similar Models

MoonshotAI: Kimi K2 0711

Kimi

Q: 19.4$1.00/1M131K ctx

Slower: 78%Cheaper: 71%

inclusionAI: Ling-2.6-flash

InclusionAI

Q: 19.3$0.01/1M262K ctx

Faster: 61%Cheaper: 100%

GLM-4.5 (Reasoning)

Z AI

Q: 19.5$1.00/1M

Slower: 59%Cheaper: 71%

Devstral 2

Mistral

Q: 19.2N/A/1M

Slower: 67%Coding: +9.5

Qwen3 Max (Preview)

Alibaba

Q: 19.2$2.40/1M

Slower: 56%Cheaper: 31%

Nova 2.0 Pro Preview (low)

Amazon

Q: 19.6$3.44/1M

Coding: +4.1

Compare all 7 models

OpenAI: GPT-4.1

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models

Market Position