Z.ai: GLM 4.5 Air

Z AI·Released 2025-07-25

Open Source131K ctx

About

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

Quality Index

16.5

238th of 537

Top 45%

Coding Index

23.8

197th of 447

Top 44%

Math Index

80.7

62nd of 269

Top 23%

Price/1M

$0.31

264th cheapest

43% below median

Top 39%

Speed

75 tok/s

Top 63%

TTFT

1.49s

Context Window

131K

236th largest

Top 73%

Market Position

Z.ai: GLM 4.5 AirMarket Average

Pricing

Input

$0.13

per 1M tokens

Output

$0.85

per 1M tokens

Blended

$0.31

per 1M tokens

Cheaper than 61% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.31

Monthly

$9.30

vs. Similar Models

Grok 4 Fast (Non-reasoning)Q:0.0

$0.28-11%

GPT-5.4 mini (Non-Reasoning)Q:+0.1

$1.69+445%

Nova 2.0 Omni (low)Q:+0.1

$0.85+174%

OpenAI: GPT-4.1 MiniQ:-0.2

$0.70+126%

Performance

tokens/sec

Faster than 37% of models

1.49

seconds

Faster than 34% of models

28.06

seconds

Faster than 17% of models

Market Median

94 tok/s

20% slower

Median TTFT

1.10s

35% slower

Throughput/Dollar

243

tok/s per $/1M

Speed Comparison

NVIDIA Nemotron Nano 9B V2 (Reasoning)

75 tok/s+0%

MiniMax: MiniMax M3

75 tok/s+0%

DeepSeek: DeepSeek V4 Pro

75 tok/s+0%

Context Window

131K

tokens

Larger than 27% of models

Max Output

98K

tokens

75% of context

Benchmarks

MMLU-Pro

81.5%

GPQA Diamond

73.3%

HLE

6.8%

LiveCodeBench

68.4%

SciCode

30.6%

TerminalBench Hard

20.5%

MATH-500

96.5%

AIME

67.3%

AIME 2025

80.7%

IFBench

37.6%

Long Context Recall

43.7%

Tau2

46.5%

Market AverageTop Score

Open Source

Quick Compare

Similar Models

Grok 4 Fast (Non-reasoning)

xAI

Q: 16.5$0.28/1M2.0M ctx

Faster: 30%Cheaper: 11%

GPT-5.4 mini (Non-Reasoning)

OpenAI

Q: 16.6$1.69/1M

Faster: 115%Pricier: 445%

Nova 2.0 Omni (low)

Amazon

Q: 16.6$0.85/1M

Pricier: 174%Coding: -9.9

Mi:dm K 2.5 Pro

Korea Telecom

Q: 16.4N/A/1M

Coding: -11.2

OpenAI: GPT-4.1 Mini

OpenAI

Q: 16.3$0.70/1M1.0M ctx

Pricier: 126%Coding: -5.3

K-EXAONE (Non-reasoning)

LG AI Research

Q: 16.7N/A/1M

Coding: -10.3

Compare all 7 models

Used by Agents

PostQode

Qwen Code

Z.ai: GLM 4.5 Air

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models

Used by Agents

Market Position