Skip to main content
Back to Explore

Grok 4.1 Fast (Non-reasoning)

xAI·Released 2025-11-19
2.0M ctxMultimodal

About

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using...

Pricing

Input

$0.20

per 1M tokens

Output

$0.50

per 1M tokens

Blended

$0.28

per 1M tokens

Cheaper than 63% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.28

Monthly

$8.25

vs. Similar Models

GLM-4.6V (Reasoning)Q:-0.1
$0.45+64%
o1-previewQ:+0.1
$28.88+10400%
GPT-5.4 mini (Non-Reasoning)Q:-0.3
$1.69+514%
Nova 2.0 Omni (low)Q:-0.3
$0.85+209%

Performance

83

tokens/sec

Faster than 42% of models

0.42

seconds

Faster than 94% of models

0.42

seconds

Faster than 96% of models

Market Median

94 tok/s

11% slower

Median TTFT

1.10s

62% faster

Throughput/Dollar

302

tok/s per $/1M

Speed Comparison

Qwen3.5 27B
83 tok/s+0%
OpenAI: GPT-4.1 Mini
83 tok/s+1%
Ministral 3 14B
84 tok/s+1%

Context Window

2.0M

tokens

Larger than 98% of models

Max Output

30K

tokens

2% of context

Benchmarks

MMLU-Pro
74.3%
GPQA Diamond
63.7%
HLE
5.0%
LiveCodeBench
39.9%
SciCode
29.6%
TerminalBench Hard
14.4%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
34.3%
IFBench
36.5%
Long Context Recall
22.0%
Tau2
63.7%
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models

Used by Agents