Llama 3.3 70B Instruct
meta/llama-3.3-70b
Model overview, pricing signals, context window, capabilities, and latest tracked AI Gateway endpoint data.
Provider
meta
Context window
128K
Max output
8.2K
Input price
$0.72 / 1M
Creator
meta
Provider
meta
Type
language
Output price
$0.72 / 1M
Latest route highlights
Best routes in the latest Supabase AI Gateway snapshot. Recommendations are scoped to tracked endpoints only.
Fastest route
101ms
groq | meta/llama-3.3-70b
Lowest p50 latency in the latest AI Gateway snapshot.
Most reliable
100%
bedrock | meta/llama-3.3-70b
Highest recent uptime in the latest AI Gateway snapshot.
Cheapest route
$1.38 / 1M
groq | meta/llama-3.3-70b
Lowest combined input plus output price in the latest AI Gateway snapshot.
Balanced route
groq
groq | meta/llama-3.3-70b
Best weighted score in the latest AI Gateway snapshot.
Score 100/100. Score = 45% latency rank + 35% uptime rank + 20% price rank across tracked endpoints.
2
101ms - 157ms
100%
May 17, 7:37 AM
| Provider | Status | Latency p50 | Latency p95 | Uptime | Throughput | Input / 1M | Output / 1M | Context | Max output | Supported parameters |
|---|---|---|---|---|---|---|---|---|---|---|
groqgroq | meta/llama-3.3-70b | 0 | 101ms | 194ms | 100% | — | $0.59 / 1M | $0.79 / 1M | 128K | 32.8K | max_tokenstemperaturestoptoolstool_choice |
bedrockbedrock | meta/llama-3.3-70b | 0 | 157ms | 551ms | 100% | 189 tok/s | $0.72 / 1M | $0.72 / 1M | 128K | 8.2K | max_tokenstemperaturestoptoolstool_choice |
Historical trends
Compact trend data generated from Supabase AI Gateway endpoint snapshots. Trends become meaningful after multiple snapshot runs.
Updated May 17, 7:37 AM
Updated May 17, 7:37 AM
Updated May 17, 7:37 AM
Updated May 17, 7:37 AM
Tracked models
10
23 endpoint rows
Chart models
10
3 snapshot hours
Models updated
May 17, 7:37 AM
Fresh Supabase snapshot
Endpoint updated
May 17, 7:37 AM
Newest snapshot May 17, 7:37 AM
Data note
Model and endpoint data are read from Supabase AI Gateway snapshots generated by the ingestion pipeline. Pricing, routing, and availability may change, so verify with official providers before production use.