About
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...
Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 72% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.17
Monthly
$5.25
vs. Similar Models
Performance
159
tokens/sec
Faster than 77% of models
0.54
seconds
Faster than 84% of models
0.54
seconds
Faster than 89% of models
Market Median
94 tok/s
70% faster
Median TTFT
1.10s
51% faster
Throughput/Dollar
911
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 89% of models
Max Output
33K
tokens
3% of context