AI Models/Models/Grok 4.1 Fast Reasoning

Back to model explorerlanguage model

Grok 4.1 Fast Reasoning

xai/grok-4.1-fast-reasoning

Model overview, pricing signals, context window, capabilities, and latest tracked AI Gateway endpoint data.

Provider

xai

Context window

Max output

Input price

$0.20 / 1M

Model overview

AI Gateway model metadata from the latest Supabase snapshot.

Creator

xai

Provider

xai

Type

language

Output price

$0.50 / 1M

Tags and capabilities

Used for filtering and future route selection logic.

reasoningfile-inputvisiontool-useimplicit-cachingfile inputtool use

Latest route highlights

Best routes in the latest Supabase AI Gateway snapshot. Recommendations are scoped to tracked endpoints only.

Fastest route

1.36s

vertex | xai/grok-4.1-fast-reasoning

Lowest p50 latency in the latest AI Gateway snapshot.

Most reliable

100%

vertex | xai/grok-4.1-fast-reasoning

Highest recent uptime in the latest AI Gateway snapshot.

Cheapest route

$0.70 / 1M

vertex | xai/grok-4.1-fast-reasoning

Lowest combined input plus output price in the latest AI Gateway snapshot.

Balanced route

vertex

vertex | xai/grok-4.1-fast-reasoning

Best weighted score in the latest AI Gateway snapshot.

Score 100/100. Score = 45% latency rank + 35% uptime rank + 20% price rank across tracked endpoints.

Endpoint comparison

Latest Supabase endpoint snapshot from 5/17/2026, 7:37:40 AM.

Provider routes

Latency p50 range

1.36s - 1.36s

Best uptime

100%

Snapshot

May 17, 7:37 AM

Provider	Status	Latency p50	Latency p95	Uptime	Throughput	Input / 1M	Output / 1M	Context	Max output	Supported parameters
vertexvertex \| xai/grok-4.1-fast-reasoning	0	1.36s	2.05s	100%	259 tok/s	$0.20 / 1M	$0.50 / 1M	1M	1M	max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning

Historical trends

Compact trend data generated from Supabase AI Gateway endpoint snapshots. Trends become meaningful after multiple snapshot runs.

Latency p50

Lower is better.

May 16, 9:58 AMMay 17, 7:37 AM

vertex

1.36s

Updated May 17, 7:37 AM

Uptime

Latest availability samples.

May 16, 9:58 AMMay 17, 7:37 AM

vertex

100%

Updated May 17, 7:37 AM

Throughput p50

Output token speed by provider route.

May 16, 9:58 AMMay 17, 7:37 AM

vertex

259 tok/s

Updated May 17, 7:37 AM

Price history

Input plus output price per 1M tokens.

May 16, 9:58 AMMay 17, 7:37 AM

vertex

$0.70

Updated May 17, 7:37 AM

AI Gateway data status

Supabase snapshots ready

Tracked models

23 endpoint rows

Chart models

3 snapshot hours

Models updated

May 17, 7:37 AM

Fresh Supabase snapshot

Endpoint updated

May 17, 7:37 AM

Newest snapshot May 17, 7:37 AM

Data note

Model and endpoint data are read from Supabase AI Gateway snapshots generated by the ingestion pipeline. Pricing, routing, and availability may change, so verify with official providers before production use.