Skip to main content
Back to Explore

Grok 4 Fast (Reasoning)

xAI·Released 2025-09-19
Multimodal

Pricing

Input

$0.20

per 1M tokens

Output

$0.50

per 1M tokens

Blended

$0.28

per 1M tokens

Cheaper than 63% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.28

Monthly

$8.25

vs. Similar Models

Gemini 3 Flash Preview (Non-reasoning)Q:0.0
$1.13+309%
Claude 3.7 Sonnet (Reasoning)Q:-0.3
$6.00+2082%
GPT-5.4 (Non-reasoning)Q:+0.3
$5.91+2048%
MiMo-V2.5-Pro (Non-reasoning)Q:+0.5
$1.35+391%

Performance

90

tokens/sec

Faster than 48% of models

5.59

seconds

Faster than 16% of models

5.59

seconds

Faster than 46% of models

Market Median

94 tok/s

5% slower

Median TTFT

1.11s

402% slower

Throughput/Dollar

327

tok/s per $/1M

Speed Comparison

MiMo-V2-Flash (Feb 2026)
90 tok/s-0%
Hermes 4 - Llama-3.1 70B (Reasoning)
90 tok/s+1%
Qwen3.5 27B (Non-reasoning)
89 tok/s-1%

Benchmarks

MMLU-Pro
85.0%
GPQA Diamond
84.7%
HLE
17.0%
LiveCodeBench
83.2%
SciCode
44.2%
TerminalBench Hard
18.9%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
89.7%
IFBench
50.5%
Long Context Recall
64.7%
Tau2
65.8%
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models