About
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...
Related Models
Pricing
Input
$0.43
per 1M tokens
Output
$1.74
per 1M tokens
Blended
$0.76
per 1M tokens
Cheaper than 44% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.76
Monthly
$22.73
vs. Similar Models
Performance
44
tokens/sec
Faster than 13% of models
1.80
seconds
Faster than 26% of models
1.80
seconds
Faster than 52% of models
Market Median
94 tok/s
53% slower
Median TTFT
1.10s
63% slower
Throughput/Dollar
58
tok/s per $/1M
Speed Comparison
Context Window
203K
tokens
Larger than 55% of models
Max Output
131K
tokens
65% of context