google/gemini-3.5-flash
Model overview, pricing signals, context window, capabilities, and latest tracked provider route data.
Open the cost calculator with Gemini 3.5 Flash selected and compare monthly token spend.
Filter the route finder to this model and rank endpoints by latency, uptime, price, or throughput.
Open the model comparison view with this model selected and add alternatives.
Best route choices in the latest indexed snapshot. These are scoped to tracked endpoints for this model, not a global provider ranking.
Fastest route
2.93s
latency
2.93s
uptime
100%
throughput
200 tok/s
price
$10.50 / 1M
Lowest p50 latency in the latest indexed route data.
Sort tracked endpoints by p50 latency ascending.
Most reliable
100%
latency
2.93s
uptime
100%
throughput
200 tok/s
price
$10.50 / 1M
Highest recent uptime in the latest indexed route data.
Sort tracked endpoints by latest available uptime descending.
Cheapest route
$10.50 / 1M
latency
2.93s
uptime
100%
throughput
200 tok/s
price
$10.50 / 1M
Lowest combined input plus output price in the latest indexed route data.
Sort tracked endpoints by input price plus output price ascending.
Balanced route
100/100
latency
2.93s
uptime
100%
throughput
200 tok/s
price
$10.50 / 1M
Best weighted score in the latest indexed route data.
Score = 45% latency rank + 35% uptime rank + 20% price rank across tracked endpoints.
2
2.93s - 3.31s
100%
May 22, 12:05 PM
| Provider | Best for | Status | Latency p50 | Latency p95 | Uptime | Throughput | Input / 1M | Output / 1M | Context | Max output | Supported parameters |
|---|---|---|---|---|---|---|---|---|---|---|---|
googlegoogle | google/gemini-3.5-flash | BalancedFastestReliableCheapest | 0 | 2.93s | 5.54s | 100% | 200 tok/s | $1.50 / 1M | $9.00 / 1M | 1M | 64K | max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning |
vertexvertex | google/gemini-3.5-flash | Tracked | 0 | 3.31s | 8.8s | 99.41% | 331 tok/s | $1.50 / 1M | $9.00 / 1M | 1M | 64K | max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning |
Compact trend data generated from indexed endpoint history. Trends become meaningful after multiple data updates.
Updated May 22, 12:05 PM
Updated May 22, 12:05 PM
Updated May 22, 12:05 PM
Updated May 22, 12:05 PM
Tracked models
275
Priority comparison set
Provider routes
454
Latest route metrics
Models updated
May 22, 12:05 PM
Fresh model index
Routes updated
May 22, 12:05 PM
Latest route update May 22, 12:05 PM
Related models
Browse nearby models from the same provider to compare context, pricing, and capability coverage.
google/gemini-3.1-flash-lite
Type
language
Context
1M
google/gemma-4-26b-a4b-it
Type
language
Context
262.1K
google/gemma-4-31b-it
Type
language
Context
262.1K
google/gemini-embedding-2
Type
embedding
Context
—
google/gemini-3.1-flash-lite-preview
Type
language
Context
1M
google/gemini-3.1-flash-image-preview
Type
language
Context
131.1K
Model and endpoint data are refreshed regularly from indexed model and route sources. Pricing, routing, and availability may change, so verify with official providers before production use.