Related Models
Pricing
Input
$0.12
per 1M tokens
Output
$0.43
per 1M tokens
Blended
$0.20
per 1M tokens
Cheaper than 69% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.20
Monthly
$6.00
vs. Similar Models
MiniMax: MiniMax M2.5Q:+0.1
$0.21+5%
Qwen: Qwen3.5 397B A17BQ:+0.1
$0.90+351%
Claude 4.1 Opus (Reasoning)Q:+0.1
$30.00+14900%
GPT-5 (medium)Q:+0.1
$3.44+1619%
Performance
118
tokens/sec
Faster than 61% of models
1.96
seconds
Faster than 23% of models
18.96
seconds
Faster than 27% of models
Market Median
95 tok/s
24% faster
Median TTFT
1.11s
77% slower
Throughput/Dollar
588
tok/s per $/1M
Speed Comparison
GLM-4.7 (Non-reasoning)
118 tok/s+0%
Mistral: Mistral Medium 3.5
117 tok/s-1%
LongCat Flash Lite
117 tok/s-1%
Benchmarks
MMLU-ProNot evaluated
GPQA Diamond
86.7%
HLE
25.5%
LiveCodeBenchNot evaluated
SciCode
41.2%
TerminalBench Hard
34.1%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
63.1%
Long Context Recall
54.7%
Tau2
92.7%
Market AverageTop Score