Related Models
Pricing
Input
$1.25
per 1M tokens
Output
$2.50
per 1M tokens
Blended
$1.56
per 1M tokens
Cheaper than 27% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$1.56
Monthly
$46.89
vs. Similar Models
Gemma 4 31B (Non-reasoning)Q:0.0
$0.20-87%
DeepSeek: DeepSeek V3.2Q:-0.1
$0.26-84%
MiMo-V2-Flash (Non-reasoning)Q:-0.1
$0.15-90%
Qwen: Qwen3.5-9BQ:+0.2
$0.11-93%
Performance
119
tokens/sec
Faster than 62% of models
0.56
seconds
Faster than 81% of models
0.56
seconds
Faster than 86% of models
Market Median
94 tok/s
26% faster
Median TTFT
1.11s
49% faster
Throughput/Dollar
76
tok/s per $/1M
Speed Comparison
Z.ai: GLM 4.7
119 tok/s-0%
GLM-4.7 (Non-reasoning)
119 tok/s-0%
Qwen: Qwen3 VL 30B A3B Instruct
120 tok/s+0%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
65.8%
HLE
6.5%
LiveCodeBenchNot evaluated
SciCode
37.4%
TerminalBench Hard
18.9%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
47.6%
Long Context Recall
24.7%
Tau2
65.8%
Market AverageTop Score