Loading...
Loading...
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...
Input
$0.13
per 1M tokens
Output
$0.85
per 1M tokens
Blended
$0.31
per 1M tokens
Cheaper than 61% of models. Median price is $0.56/1M tokens.
Daily
$0.31
Monthly
$9.30
67
tokens/sec
Faster than 36% of models
1.43
seconds
Faster than 34% of models
31.31
seconds
Faster than 14% of models
Market Median
86 tok/s
22% slower
Median TTFT
1.07s
34% slower
Throughput/Dollar
216
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 33% of models
Max Output
98K
tokens
75% of context
Quality Index
23.2
215th of 507
Top 43%
Coding Index
23.8
167th of 417
Top 40%
Math Index
80.7
62nd of 269
Top 23%
Price/1M
$0.31
246th cheapest
45% below median
Top 39%
Speed
67 tok/s
Top 64%
TTFT
1.43s
Context Window
131K
201st largest
Top 67%