Kimi K2.6 and Grok 4.3 undercut the field below $1.60 while GPT-5.5 stays expensive at the top
Weekly LLM briefing: Kimi K2.6 hits 53.9 quality at $1.42/M tokens, Grok 4.3 delivers 133 tok/s at $1.56, and the budget tier closes in on mid-range models.
The sub-$2 tier now scores within 7 points of the best model available
Two models priced under $1.60 per million tokens now score above 53 on quality index. Kimi K2.6 (MoonshotAI) posts 53.9 at $1.42/M, and Grok 4.3 (xAI) hits 53.2 at $1.56/M with 133 tokens per second. That puts them 6-7 points behind GPT-5.5 (OpenAI), which leads at 60.2 but costs $11.25/M. The quality gap is narrowing faster than the price gap is closing.
What moved this week
The big story isn't a single launch. It's the shape of the market crystallizing into three distinct price bands with diminishing quality returns at the top.
Premium ($10+/M): GPT-5.5 holds the quality crown at 60.2 but at 66 tok/s, it's the slowest OpenAI model in the table. Claude Opus 4.7 scores 57.3 at $10.00/M with even slower throughput at 49 tok/s. You're paying 7x more than the mid-tier for roughly 5-6 points of quality.
Mid-range ($3-6/M): This is where value concentrates. Gemini 3.1 Pro Preview delivers 57.2 quality at $4.50/M and 125 tok/s. Qwen3.7 Max (Alibaba) is open source, scores 56.6 at $3.75/M, and pushes 198 tok/s. Gemini 3.5 Flash trades 2 quality points for 210 tok/s at $3.38/M.
Budget (<$2/M): Kimi K2.6 and Grok 4.3 now compete with models that cost 3-4x more. Kimi is open source, which matters for self-hosting.
| Model | Quality | Price/M | Speed | Open source |
|---|---|---|---|---|
| GPT-5.5 | 60.2 | $11.25 | 66 tok/s | No |
| Gemini 3.1 Pro | 57.2 | $4.50 | 125 tok/s | No |
| Qwen3.7 Max | 56.6 | $3.75 | 198 tok/s | Yes |
| Kimi K2.6 | 53.9 | $1.42 | 103 tok/s | Yes |
| Grok 4.3 | 53.2 | $1.56 | 133 tok/s | No |
Stay in the loop
Weekly LLM analysis delivered to your inbox. No spam.