Agentic & Tool Use Leaderboard
Models ranked by Agentic Score, based on independent benchmark evaluations.
Each row shows the model's benchmark score alongside its pricing and output speed, so you can evaluate quality-to-cost tradeoffs at a glance.
Agentic & Tool Use Leaderboard
Top 20 models ranked by agentic score
| Rank | Model | Agentic Score |
|---|---|---|
| 🥇 | Anthropic | 68.2 |
| 🥈 | Anthropic | 65.0 |
| 🥉 | OpenAI | 64.9 |
| 4 | Anthropic | 63.6 |
| 5 | OpenAI | 62.4 |
| 6 | OpenAI | 61.3 |
| 7 | OpenAI | 61.0 |
| 8 | 60.2 | |
| 9 | Z AI | 60.0 |
| 10 | 57.7 | |
| 11 | Alibaba | 56.0 |
| 12 | Anthropic | 55.1 |
| 13 | OpenAI | 52.2 |
| 14 | DeepSeek | 51.9 |
| 15 | MiniMax | 51.5 |
| 16 | Kimi | 51.4 |
| 17 | Xiaomi | 51.2 |
| 18 | Meta | 50.9 |
| 19 | Nex Agi | 50.1 |
| 20 | Kimi | 49.4 |