Loading...
Loading...
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
Índice de Qualidade
30.2
102nd de 444
Top 23%
Índice de Código
30.2
80th de 354
Top 23%
Índice de Matemática
44.3
148th de 268
Top 55%
Preço/1M
$1.00
487th mais barato
233% acima da mediana
Top 72%
Velocidade
80 tok/s
Top 33%
TTFT
2.09s
Janela de Contexto
205K
103rd maior
Top 29%
Entrada
$0.60
por 1M tokens
Saída
$2.20
por 1M tokens
Combinado
$1.00
por 1M tokens
Mais barato que 28% dos modelos. Preço mediano é $0.30/1M tokens.
Diário
$1.00
Mensal
$30.00
80
tokens/seg
Mais rápido que 67% dos modelos
2.09
segundos
Mais rápido que 14% dos modelos
2.09
segundos
Mais rápido que 29% dos modelos
Mediana do Mercado
45 tok/s
76% mais rápido
TTFT Mediano
0.42s
400% mais lento
Vazão/Dólar
80
tok/s por $/1M
Comparação de Velocidade
Janela de Contexto
205K
tokens
Maior que 71% dos modelos
Saída Máxima
205K
tokens
100% do contexto