About
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...
Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 72% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.17
Monthly
$5.25
vs. Similar Models
Performance
182
tokens/sec
Faster than 83% of models
0.52
seconds
Faster than 86% of models
0.52
seconds
Faster than 90% of models
Market Median
94 tok/s
93% faster
Median TTFT
1.11s
53% faster
Throughput/Dollar
1039
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 89% of models
Max Output
33K
tokens
3% of context