Loading...
Loading...
Models ranked by GPQA Diamond, based on independent benchmark evaluations.
Each row shows the model's benchmark score alongside its pricing and output speed, so you can evaluate quality-to-cost tradeoffs at a glance.
Top 20 models ranked by gpqa diamond
| Rank | Model | GPQA Diamond |
|---|---|---|
| 🥇 | 0.9 | |
| 🥈 | OpenAI | 0.9 |
| 🥉 | OpenAI | 0.9 |
| 4 | OpenAI | 0.9 |
| 5 | OpenAI | 0.9 |
| 6 | OpenAI | 0.9 |
| 7 | Anthropic | 0.9 |
| 8 | Kimi | 0.9 |
| 9 | xAI | 0.9 |
| 10 | OpenAI | 0.9 |
| 11 | 0.9 | |
| 12 | DeepSeek | 0.9 |
| 13 | OpenAI | 0.9 |
| 14 | xAI | 0.9 |
| 15 | OpenAI | 0.9 |
| 16 | 0.9 | |
| 17 | Anthropic | 0.9 |
| 18 | DeepSeek | 0.9 |
| 19 | Alibaba | 0.9 |
| 20 | Alibaba | 0.9 |