Z.ai: GLM 4.5V

Z AI·Released 2025-08-11

Open Source66K ctxMultimodal

About

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

Quality Index

12.7

374th of 507

Top 74%

Coding Index

10.8

312th of 417

Top 75%

Math Index

15.3

217th of 269

Top 81%

Price/1M

$0.90

394th cheapest

61% above median

Top 63%

Speed

48 tok/s

Top 82%

TTFT

37.52s

Context Window

66K

331st largest

Top 83%

Market Position

Z.ai: GLM 4.5VMarket Average

Pricing

Input

$0.60

per 1M tokens

Output

$1.80

per 1M tokens

Blended

$0.90

per 1M tokens

Cheaper than 37% of models. Median price is $0.56/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.90

Monthly

$27.00

vs. Similar Models

Google: Gemini 2.5 Flash LiteQ:0.0

$0.17-81%

Mistral Small 3Q:0.0

$0.10-88%

Nova LiteQ:0.0

$0.10-88%

Hermes 4 - Llama-3.1 70B (Non-reasoning)Q:-0.1

$0.20-78%

Performance

48

tokens/sec

Faster than 18% of models

37.52

seconds

Faster than 3% of models

37.52

seconds

Faster than 12% of models

Market Median

86 tok/s

44% slower

Median TTFT

1.07s

3400% slower

Throughput/Dollar

53

tok/s per $/1M

Speed Comparison

Kimi K2.5 (Non-reasoning)

48 tok/s+1%

Mistral: Mistral Medium 3

47 tok/s-1%

Llama 3.2 Instruct 90B (Vision)

48 tok/s+1%

Context Window

66K

tokens

Larger than 17% of models

Max Output

16K

tokens

25% of context

Benchmarks

MMLU-Pro

75.1%

GPQA Diamond

57.3%

HLE

3.6%

LiveCodeBench

35.2%

SciCode

18.8%

TerminalBench Hard

6.8%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

15.3%

IFBench

28.6%

Long Context Recall

0.0%

Tau2

19.6%

Market AverageTop Score

Open Source

Quick Compare

Similar Models

Google: Gemini 2.5 Flash Lite

Google

Q: 12.7$0.17/1M1.0M ctx

Faster: 443%Cheaper: 81%

Mistral Small 3

Mistral

Q: 12.7$0.10/1M

Faster: 227%Cheaper: 88%

Nova Lite

Amazon

Q: 12.7$0.10/1M

Faster: 269%Cheaper: 88%

OpenAI: GPT-4o-mini

OpenAI

Q: 12.6$0.26/1M128K ctx

Faster: 26%Cheaper: 71%

Hermes 4 - Llama-3.1 70B (Non-reasoning)

Nous Research

Q: 12.6$0.20/1M

Faster: 72%Cheaper: 78%

OpenAI: GPT-4

OpenAI

Q: 12.8$37.50/1M8K ctx

Slower: 48%Pricier: 4067%

Compare all 7 models

Quality Index

12.7

374th of 507

Top 74%

Coding Index

10.8

312th of 417

Top 75%

Math Index

15.3

217th of 269

Top 81%

Price/1M

$0.90

394th cheapest

61% above median

Top 63%

Speed

48 tok/s

Top 82%

TTFT

37.52s

Context Window

66K

331st largest

Top 83%

Market Position

Z.ai: GLM 4.5VMarket Average

Z.ai: GLM 4.5V

About

Related Models

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models

Market Position