Loading...
Loading...
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...
Input
$0.39
per 1M tokens
Output
$1.90
per 1M tokens
Blended
$0.77
per 1M tokens
Cheaper than 44% of models. Median price is $0.56/1M tokens.
Daily
$0.77
Monthly
$23.02
30
tokens/sec
Faster than 3% of models
1.23
seconds
Faster than 41% of models
1.23
seconds
Faster than 60% of models
Market Median
86 tok/s
65% slower
Median TTFT
1.07s
14% slower
Throughput/Dollar
39
tok/s per $/1M
Speed Comparison
Context Window
205K
tokens
Larger than 62% of models
Max Output
205K
tokens
100% of context
Quality Index
30.2
148th of 507
Top 29%
Coding Index
30.2
118th of 417
Top 29%
Math Index
44.3
149th of 269
Top 55%
Price/1M
$0.77
358th cheapest
37% above median
Top 56%
Speed
30 tok/s
Top 97%
TTFT
1.23s
Context Window
205K
156th largest
Top 38%