Loading...
Loading...
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.
Quality Index
13.0
308th of 444
Top 70%
Coding Index
11.2
245th of 354
Top 69%
Math Index
24.0
199th of 268
Top 75%
Price/1M
$0.17
280th cheapest
42% below median
Top 42%
Speed
106 tok/s
Top 24%
TTFT
0.39s
Context Window
1.0M
23rd largest
Top 7%
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 58% of models. Median price is $0.30/1M tokens.
Daily
$0.17
Monthly
$5.25
106
tokens/sec
Faster than 76% of models
0.39
seconds
Faster than 53% of models
0.39
seconds
Faster than 55% of models
Market Median
45 tok/s
133% faster
Median TTFT
0.42s
7% faster
Throughput/Dollar
604
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 93% of models
Max Output
33K
tokens
3% of context