About
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...
Related Models
Pricing
Input
$0.06
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.14
per 1M tokens
Cheaper than 77% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.14
Monthly
$4.35
vs. Similar Models
Performance
105
tokens/sec
Faster than 56% of models
0.92
seconds
Faster than 60% of models
20.05
seconds
Faster than 25% of models
Market Median
94 tok/s
11% faster
Median TTFT
1.11s
17% faster
Throughput/Dollar
721
tok/s per $/1M
Speed Comparison
Context Window
203K
tokens
Larger than 55% of models
Max Output
16K
tokens
8% of context