About
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...
Related Models
Pricing
Input
$0.13
per 1M tokens
Output
$0.85
per 1M tokens
Blended
$0.31
per 1M tokens
Cheaper than 61% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.31
Monthly
$9.30
vs. Similar Models
Performance
75
tokens/sec
Faster than 37% of models
1.49
seconds
Faster than 34% of models
28.06
seconds
Faster than 17% of models
Market Median
94 tok/s
20% slower
Median TTFT
1.10s
35% slower
Throughput/Dollar
243
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 27% of models
Max Output
98K
tokens
75% of context