Related Models
Performance
69
tokens/sec
Faster than 34% of models
0.20
seconds
Faster than 99% of models
29.24
seconds
Faster than 16% of models
Market Median
94 tok/s
26% slower
Median TTFT
1.10s
82% faster
Speed Comparison
GPT-5.5 (low)
69 tok/s-0%
Qwen: Qwen3 VL 32B Instruct
69 tok/s+1%
Grok Build 0.1 0616
68 tok/s-1%
Benchmarks
MMLU-Pro
79.0%
GPQA Diamond
73.3%
HLE
9.8%
LiveCodeBench
80.7%
SciCode
37.3%
TerminalBench Hard
14.4%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
88.0%
IFBench
69.1%
Long Context Recall
50.3%
Tau2
69.3%
Market AverageTop Score