AI Models/Models/Gemini 3.5 Flash

Back to model explorer Estimate cost Compare routes Compare model

language model

Gemini 3.5 Flash

google/gemini-3.5-flash

Model overview, pricing signals, context window, capabilities, and latest tracked provider route data.

Provider

google

Context window

Max output

64K

Input price

$1.50 / 1M

Estimate this model

Open the cost calculator with Gemini 3.5 Flash selected and compare monthly token spend.

Pricing indexed

Compare provider routes

Filter the route finder to this model and rank endpoints by latency, uptime, price, or throughput.

2 routes indexed

Compare against alternatives

Open the model comparison view with this model selected and add alternatives.

Side-by-side planning

Model overview

Model metadata from the latest indexed update.

Creator

google

Provider

google

Type

language

Output price

$9.00 / 1M

Tags and capabilities

Used for filtering and future route selection logic.

reasoningfile-inputvisiontool-useweb-searchimplicit-cachingfile inputtool useweb search

Route decision summary

Best route choices in the latest indexed snapshot. These are scoped to tracked endpoints for this model, not a global provider ranking.

Fastest route

2.93s

latest

Provider route

google

google | google/gemini-3.5-flash

latency

2.93s

uptime

100%

throughput

200 tok/s

price

$10.50 / 1M

Lowest p50 latency in the latest indexed route data.

Sort tracked endpoints by p50 latency ascending.

Most reliable

100%

latest

Provider route

google

google | google/gemini-3.5-flash

latency

2.93s

uptime

100%

throughput

200 tok/s

price

$10.50 / 1M

Highest recent uptime in the latest indexed route data.

Sort tracked endpoints by latest available uptime descending.

Cheapest route

$10.50 / 1M

latest

Provider route

google

google | google/gemini-3.5-flash

latency

2.93s

uptime

100%

throughput

200 tok/s

price

$10.50 / 1M

Lowest combined input plus output price in the latest indexed route data.

Sort tracked endpoints by input price plus output price ascending.

Balanced route

100/100

latest

Provider route

google

google | google/gemini-3.5-flash

latency

2.93s

uptime

100%

throughput

200 tok/s

price

$10.50 / 1M

Best weighted score in the latest indexed route data.

Score = 45% latency rank + 35% uptime rank + 20% price rank across tracked endpoints.

Endpoint comparison

Latest endpoint data from 5/22/2026, 12:05:00 PM.

Provider routes

Latency p50 range

2.93s - 3.31s

Best uptime

100%

Snapshot

May 22, 12:05 PM

Provider	Best for	Status	Latency p50	Latency p95	Uptime	Throughput	Input / 1M	Output / 1M	Context	Max output	Supported parameters
googlegoogle \| google/gemini-3.5-flash	BalancedFastestReliableCheapest	0	2.93s	5.54s	100%	200 tok/s	$1.50 / 1M	$9.00 / 1M	1M	64K	max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning
vertexvertex \| google/gemini-3.5-flash	Tracked	0	3.31s	8.8s	99.41%	331 tok/s	$1.50 / 1M	$9.00 / 1M	1M	64K	max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning

Historical trends

Compact trend data generated from indexed endpoint history. Trends become meaningful after multiple data updates.

Latency p50

Lower is better.

May 20, 12:07 PMMay 22, 12:05 PM

google

2.93s

vertex

3.31s

Updated May 22, 12:05 PM

Uptime

Latest availability samples.

May 20, 12:07 PMMay 22, 12:05 PM

google

100%

vertex

99.41%

Updated May 22, 12:05 PM

Throughput p50

Output token speed by provider route.

May 20, 12:07 PMMay 22, 12:05 PM

google

200 tok/s

vertex

331 tok/s

Updated May 22, 12:05 PM

Price history

Input plus output price per 1M tokens.

May 20, 12:07 PMMay 22, 12:05 PM

google

$10.50

vertex

$10.50

Updated May 22, 12:05 PM

AI model data status

Data current

Tracked models

275

Priority comparison set

Provider routes

454

Latest route metrics

Models updated

May 22, 12:05 PM

Fresh model index

Routes updated

May 22, 12:05 PM

Latest route update May 22, 12:05 PM

Related models

More models from google

Browse nearby models from the same provider to compare context, pricing, and capability coverage.

Gemini 3.1 Flash Lite

google/gemini-3.1-flash-lite

Type

language

Context

Gemma 4 26B A4B IT

google/gemma-4-26b-a4b-it

Type

language

Context

262.1K

Gemma 4 31B IT

google/gemma-4-31b-it

Type

language

Context

262.1K

Gemini Embedding 2

google/gemini-embedding-2

Type

embedding

Context

—

Gemini 3.1 Flash Lite Preview

google/gemini-3.1-flash-lite-preview

Type

language

Context

Gemini 3.1 Flash Image Preview (Nano Banana 2)

google/gemini-3.1-flash-image-preview

Type

language

Context

131.1K

Data note

Model and endpoint data are refreshed regularly from indexed model and route sources. Pricing, routing, and availability may change, so verify with official providers before production use.

Provider

Best for

Status

Latency p50

Latency p95

Uptime

Throughput

Input / 1M

Output / 1M

Context

Max output

Supported parameters

googlegoogle | google/gemini-3.5-flash

BalancedFastestReliableCheapest

2.93s

5.54s

100%

200 tok/s

$1.50 / 1M

$9.00 / 1M

64K

max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning

vertexvertex | google/gemini-3.5-flash

Tracked

3.31s

8.8s

99.41%

331 tok/s

$1.50 / 1M

$9.00 / 1M

64K

max_tokenstemperaturestoptoolstool_choicereasoninginclude_reasoning