Related Models
Pricing
Input
$2.00
per 1M tokens
Output
$6.00
per 1M tokens
Blended
$3.00
per 1M tokens
Cheaper than 20% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$3.00
Monthly
$90.00
vs. Similar Models
GLM-4.5V (Reasoning)Q:0.0
$0.90-70%
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Q:0.0
$0.90-70%
Mistral Small 3.2Q:0.0
$0.13-96%
Qwen3 30B A3B 2507 InstructQ:0.0
$0.21-93%
Performance
53
tokens/sec
Faster than 22% of models
0.78
seconds
Faster than 67% of models
0.78
seconds
Faster than 76% of models
Market Median
94 tok/s
44% slower
Median TTFT
1.10s
30% faster
Throughput/Dollar
18
tok/s per $/1M
Speed Comparison
Qwen: Qwen3.6 Plus
52 tok/s-0%
Z.ai: GLM 5
53 tok/s+0%
MiniMax M2.7
52 tok/s-0%
Benchmarks
MMLU-Pro
69.7%
GPQA Diamond
48.6%
HLE
4.0%
LiveCodeBench
29.3%
SciCode
29.2%
TerminalBench Hard
6.1%
MATH-500
73.6%
AIME
11.0%
AIME 2025
14.0%
IFBench
31.2%
Long Context Recall
5.3%
Tau2
30.7%
Market AverageTop Score