Grok 4.1 Fast is xAI's most cost-effective model. It excels at tool calling for lightweight tasks, powers latency-sensitive applications, and shines in search-related tasks.
Reasoning
View model card in Model Garden
Model ID
grok-4.1-fast-reasoning
Launch stage
Preview
Supported inputs & outputs
- Inputs:
Text , Image - Outputs:
Text
Capabilities
- Supported
- Function calling Preview feature
- Structured output Preview feature
- Reasoning Preview feature
- Not supported
- Batch predictions Preview feature
Usage types
- Supported
- Fixed quota Preview feature
- Not supported
- Dynamic shared quota Preview feature
- Provisioned Throughput Preview feature
Versions
-
grok-4.1-fast-reasoning - Launch stage: Preview
- Release date: April 7, 2026
Supported regions
Model availability
- Global
-
global endpoint
Quota limits
global endpoint:
- QPM: 160
- Input TPM: 880,000
- Output TPM: 40,000
- Context length: 128,000
Pricing
Non-Reasoning
View model card in Model Garden
Model ID
grok-4.1-fast-non-reasoning
Launch stage
Preview
Supported inputs & outputs
- Inputs:
Text , Image - Outputs:
Text
Capabilities
- Supported
- Function calling Preview feature
- Structured output Preview feature
- Not supported
- Batch predictions Preview feature
- Reasoning Preview feature
Usage types
- Supported
- Fixed quota Preview feature
- Not supported
- Dynamic shared quota Preview feature
- Provisioned Throughput Preview feature
Versions
-
grok-4.1-fast-non-reasoning - Launch stage: Preview
- Release date: April 7, 2026
Supported regions
Model availability
- Global
-
global endpoint
Quota limits
global endpoint:
- QPM: 160
- Input TPM: 880,000
- Output TPM: 40,000
- Context length: 128,000
Pricing

