AI Models/Models/Grok 4.1 Fast Reasoning
Back to model explorerlanguage model

Grok 4.1 Fast Reasoning

xai/grok-4.1-fast-reasoning

Model overview, pricing signals, context window, capabilities, and latest tracked AI Gateway endpoint data.

Provider

xai

Context window

1M

Max output

1M

Input price

$0.20 / 1M

Model overview
AI Gateway model metadata from the latest Supabase snapshot.

Creator

xai

Provider

xai

Type

language

Output price

$0.50 / 1M

Tags and capabilities
Used for filtering and future route selection logic.
reasoningfile-inputvisiontool-useimplicit-cachingfile inputtool use

Latest route highlights

Best routes in the latest Supabase AI Gateway snapshot. Recommendations are scoped to tracked endpoints only.

Fastest route

1.36s

vertex | xai/grok-4.1-fast-reasoning

Lowest p50 latency in the latest AI Gateway snapshot.

Most reliable

100%

vertex | xai/grok-4.1-fast-reasoning

Highest recent uptime in the latest AI Gateway snapshot.

Cheapest route

$0.70 / 1M

vertex | xai/grok-4.1-fast-reasoning

Lowest combined input plus output price in the latest AI Gateway snapshot.

Balanced route

vertex

vertex | xai/grok-4.1-fast-reasoning

Best weighted score in the latest AI Gateway snapshot.

Score 100/100. Score = 45% latency rank + 35% uptime rank + 20% price rank across tracked endpoints.

Endpoint comparison
Latest Supabase endpoint snapshot from 5/17/2026, 7:37:40 AM.
Provider routes

1

Latency p50 range

1.36s - 1.36s

Best uptime

100%

Snapshot

May 17, 7:37 AM

ProviderStatusLatency p50Latency p95UptimeThroughputInput / 1MOutput / 1MContextMax outputSupported parameters
vertexvertex | xai/grok-4.1-fast-reasoning
01.36s2.05s100%259 tok/s$0.20 / 1M$0.50 / 1M1M1M
max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning

Historical trends

Compact trend data generated from Supabase AI Gateway endpoint snapshots. Trends become meaningful after multiple snapshot runs.

Latency p50
Lower is better.
vertex: 2.96s at May 16, 9:58 AMvertex: 1.36s at May 17, 7:24 AMvertex: 1.36s at May 17, 7:37 AM
May 16, 9:58 AMMay 17, 7:37 AM
vertex
1.36s

Updated May 17, 7:37 AM

Uptime
Latest availability samples.
vertex: 100% at May 16, 9:58 AMvertex: 100% at May 17, 7:24 AMvertex: 100% at May 17, 7:37 AM
May 16, 9:58 AMMay 17, 7:37 AM
vertex
100%

Updated May 17, 7:37 AM

Throughput p50
Output token speed by provider route.
vertex: 245.5 tok/s at May 16, 9:58 AMvertex: 259 tok/s at May 17, 7:24 AMvertex: 259 tok/s at May 17, 7:37 AM
May 16, 9:58 AMMay 17, 7:37 AM
vertex
259 tok/s

Updated May 17, 7:37 AM

Price history
Input plus output price per 1M tokens.
vertex: $0.70 at May 16, 9:58 AMvertex: $0.70 at May 17, 7:24 AMvertex: $0.70 at May 17, 7:37 AM
May 16, 9:58 AMMay 17, 7:37 AM
vertex
$0.70

Updated May 17, 7:37 AM

AI Gateway data status
Supabase snapshots ready

Tracked models

10

23 endpoint rows

Chart models

10

3 snapshot hours

Models updated

May 17, 7:37 AM

Fresh Supabase snapshot

Endpoint updated

May 17, 7:37 AM

Newest snapshot May 17, 7:37 AM

Data note

Model and endpoint data are read from Supabase AI Gateway snapshots generated by the ingestion pipeline. Pricing, routing, and availability may change, so verify with official providers before production use.