Skip to content

Grok 4.1 Fast Reasoning

Grok 4.1 Fast Reasoning is xAI's reasoning-enabled Grok 4.1 Fast model optimized for agentic operations. It combines structured chain-of-thought reasoning with speed-optimized inference and a context window of 1M tokens for complex agent workflows.

ReasoningFile InputVision (Image)Tool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xai/grok-4.1-fast-reasoning',
prompt: 'Why is the sky blue?'
})

More models by xAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
256K
0.3s
197tps
$1.00/M
$2.00/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
05/20/2026
1M
1.0s
80tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
04/30/2026
2M
0.5s
127tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
03/11/2026
2M
0.6s
50tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
vertex logo
xai logo
03/09/2026
2M
3.2s
809tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
03/09/2026
1M
0.3s
87tps
$0.20/M$0.50/M
Read:$0.05/M
Write:
vertex logo
07/09/2025