Loading...
Loading...
Top 20 models ranked by output speed (tok/s)
| Rank | Model | Output Speed (tok/s) |
|---|---|---|
| 🥇 | Inception | 975 tok/s |
| 🥈 | IBM | 445 tok/s |
| 🥉 | 401 tok/s | |
| 4 | NVIDIA | 386 tok/s |
| 5 | 362 tok/s | |
| 6 | IBM | 330 tok/s |
| 7 | 322 tok/s | |
| 8 | Mistral | 293 tok/s |
| 9 | Amazon | 293 tok/s |
| 10 | Sarvam | 283 tok/s |
| 11 | OpenAI | 273 tok/s |
| 12 | OpenAI | 272 tok/s |
| 13 | OpenAI | 266 tok/s |
| 14 | OpenAI | 262 tok/s |
| 15 | 243 tok/s | |
| 16 | Amazon | 235 tok/s |
| 17 | OpenAI | 224 tok/s |
| 18 | 223 tok/s | |
| 19 | Alibaba | 223 tok/s |
| 20 | Alibaba | 223 tok/s |