Related Models
Performance
16
tokens/sec
Faster than 0% of models
1.36
seconds
Faster than 39% of models
1.36
seconds
Faster than 60% of models
Market Median
94 tok/s
82% slower
Median TTFT
1.10s
23% slower
Speed Comparison
ERNIE 4.5 300B A47B
24 tok/s+44%
Qwen3.5 4B
25 tok/s+50%
MoonshotAI: Kimi K2 0905
26 tok/s+56%
Benchmarks
MMLU-Pro
48.5%
GPQA Diamond
31.5%
HLE
4.4%
LiveCodeBench
13.1%
SciCode
11.0%
TerminalBench HardNot evaluated
MATH-500
69.3%
AIME
9.3%
AIME 2025Not evaluated
IFBenchNot evaluated
Long Context RecallNot evaluated
Tau2Not evaluated
Market AverageTop Score