Loading...
Loading...
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.
Quality Index
30.1
103rd of 444
Top 23%
Coding Index
25.9
100th of 354
Top 29%
Price/1M
$0.15
275th cheapest
49% below median
Top 41%
Speed
86 tok/s
Top 30%
TTFT
0.69s
Context Window
203K
105th largest
Top 30%
Input
$0.07
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.15
per 1M tokens
Cheaper than 59% of models. Median price is $0.30/1M tokens.
Daily
$0.15
Monthly
$4.56
86
tokens/sec
Faster than 70% of models
0.69
seconds
Faster than 37% of models
23.88
seconds
Faster than 12% of models
Market Median
45 tok/s
90% faster
Median TTFT
0.42s
64% slower
Throughput/Dollar
567
tok/s per $/1M
Speed Comparison
Context Window
203K
tokens
Larger than 70% of models