AI Tools HubAI Tools Hub
AI Models
AI Developer ToolsWorkspace overviewModel ExplorerBrowse models and providersModel ComparisonCompare models side by sideRoute FinderRank endpoints by route signalCost CalculatorEstimate monthly API spendTrend ChartsTrack latency, uptime, and price
Categories
Best Tools
Popular
Best Free AI ToolsBest AI Writing ToolsBest AI Tools for ProductivityBest AI Research Tools
Creative
Best AI Image GeneratorsBest AI Video GeneratorsBest AI Audio ToolsBest AI Tools for DesignersBest AI Social Media Tools
Business & Build
Best AI Marketing ToolsBest AI Customer Service ToolsBest AI Tools for DevelopersBest AI Game Development ToolsView all best tools
Compare
Assistants
ChatGPT vs ClaudeChatGPT vs GeminiChatGPT vs PerplexityPerplexity vs GeminiPlayground vs KittlCanva Magic vs Figma AIMidjourney vs ChatGPT ImagesSynthesia vs VidnozRunway vs SynthesiaCanva vs Midjourney
Writing & Marketing
Buffer vs HootsuiteJasper vs AdCreative.aiGrammarly vs WordtuneGrammarly vs Jasper
Business
Intercom Fin vs Zendesk AIIntercom Fin vs GorgiasZendesk AI vs Gorgias
Developers
GitHub Copilot vs CursorCursor vs Claude CodeLovable vs Bolt vs v0AI Chatbot ShowdownAI Coding AssistantsView all comparisons
Alternatives
ChatGPT AlternativesMidjourney AlternativesClaude AlternativesPerplexity Alternatives
AI ModelsAI CompareAI RoutesAI TrendsAI CostCategoriesBest ToolsCompareUpdatesAlternatives
OverviewModelsCompareRoutesCostTrends
AI Tools HubAI Tools Hub

Discover, compare, and find the perfect AI tools to accelerate your workflow, boost creativity, and scale your business.

Tool profiles are reviewed against official product sources where available.

Categories

  • AI Chatbots
  • AI Writing Tools
  • AI Image Generators
  • AI Video Tools
  • AI Coding Assistants
  • Browse All Categories →

Best Tools

  • Best Free AI Tools
  • Best AI Writing
  • Best AI Image Gen
  • Best AI Video Gen
  • Best Design AI
  • View All Top Lists →

Compare

  • Lovable vs Bolt vs v0
  • ChatGPT vs Claude
  • Perplexity vs Gemini
  • Canva Magic vs Figma AI
  • Intercom Fin vs Zendesk AI
  • All Comparisons →

Alternatives

  • ChatGPT Alternatives
  • Midjourney Alternatives
  • Claude Alternatives
  • Perplexity Alternatives
  • More Alternatives →

ToolsHub helps users discover and compare AI tools using curated summaries, practical use cases, and public product information. Features and pricing can change, so always check the official product website before making a final decision.

AboutUpdatesEditorial PolicyAdvertising PolicyContactPrivacy PolicyTerms

© 2026 ToolsHub.

Human-Curated AI Directory

Back to AI Developer ToolsHistorical Trend Charts

Track AI route trends over time

Review a focused chart set for tracked models across latency, uptime, throughput, and token price signals.

Tracked models

273

Featured charts

24

Samples

752

Latest update

May 20, 12:07 PM

Featured trend set

Showing 24 of 24 charted models from 273 tracked models.

Qwen3-14B
alibaba/qwen-3-14b
1 providers

Latency

270ms

Uptime

100%

Throughput

39 tok/s

Price

$0.36

Latency p50

1 lines
deepinfra: 275ms at May 18, 2:07 AMdeepinfra: 299ms at May 18, 12:50 PMdeepinfra: 280ms at May 19, 12:25 PMdeepinfra: 270ms at May 20, 12:07 PM
Model detailRoutes
Qwen3 235B A22b Instruct 2507
alibaba/qwen-3-235b
4 providers

Latency

104ms

Uptime

100%

Throughput

72 tok/s

Price

$0.534

Latency p50

4 lines
cerebras: 136ms at May 18, 2:07 AMcerebras: 113ms at May 18, 12:50 PMcerebras: 109ms at May 19, 12:25 PMcerebras: 104ms at May 20, 12:07 PMdeepinfra: 337ms at May 18, 2:07 AMdeepinfra: 336ms at May 18, 12:50 PMdeepinfra: 1.96s at May 19, 12:25 PMdeepinfra: 387ms at May 20, 12:07 PMnovita: 1.28s at May 18, 2:07 AMnovita: 1.2s at May 18, 12:50 PMnovita: 1.24s at May 19, 12:25 PMnovita: 1.21s at May 20, 12:07 PMvertex: 417ms at May 18, 2:07 AMvertex: 360ms at May 18, 12:50 PMvertex: 382ms at May 19, 12:25 PMvertex: 427ms at May 20, 12:07 PM
Model detailRoutes
Qwen3-30B-A3B
alibaba/qwen-3-30b
1 providers

Latency

233ms

Uptime

100%

Throughput

55 tok/s

Price

$0.37

Latency p50

1 lines
deepinfra: 218ms at May 18, 2:07 AMdeepinfra: 215ms at May 18, 12:50 PMdeepinfra: 229ms at May 19, 12:25 PMdeepinfra: 233ms at May 20, 12:07 PM
Model detailRoutes
Qwen 3 32B
alibaba/qwen-3-32b
4 providers

Latency

245ms

Uptime

100%

Throughput

285 tok/s

Price

$0.40

Latency p50

4 lines
alibaba: 366ms at May 18, 2:07 AMalibaba: 378ms at May 18, 12:50 PMalibaba: 650ms at May 19, 12:25 PMalibaba: 836ms at May 20, 12:07 PMbedrock: 157ms at May 18, 2:07 AMbedrock: 162ms at May 18, 12:50 PMbedrock: 241ms at May 19, 12:25 PMbedrock: 251ms at May 20, 12:07 PMdeepinfra: 237ms at May 18, 2:07 AMdeepinfra: 247ms at May 18, 12:50 PMdeepinfra: 259ms at May 19, 12:25 PMdeepinfra: 245ms at May 20, 12:07 PMgroq: 168ms at May 18, 2:07 AMgroq: 189ms at May 18, 12:50 PMgroq: 195ms at May 19, 12:25 PMgroq: 275ms at May 20, 12:07 PM
Model detailRoutes
Qwen 3.6 Max Preview
alibaba/qwen-3.6-max-preview
1 providers

Latency

2.84s

Uptime

100%

Throughput

80 tok/s

Price

$9.10

Latency p50

1 lines
alibaba: 1.73s at May 18, 2:07 AMalibaba: 1.82s at May 18, 12:50 PMalibaba: 3.78s at May 19, 12:25 PMalibaba: 2.84s at May 20, 12:07 PM
Model detailRoutes
Qwen3 VL 235B A22B Thinking
alibaba/qwen3-235b-a22b-thinking
3 providers

Latency

279ms

Uptime

100%

Throughput

88 tok/s

Price

$2.53

Latency p50

3 lines
alibaba: 1.12s at May 18, 2:07 AMalibaba: 1.05s at May 18, 12:50 PMalibaba: 1.62s at May 19, 12:25 PMalibaba: 1.22s at May 20, 12:07 PMdeepinfra: 296ms at May 18, 2:07 AMdeepinfra: 282ms at May 18, 12:50 PMdeepinfra: 718ms at May 19, 12:25 PMdeepinfra: 279ms at May 20, 12:07 PMnovita: 1.03s at May 18, 2:07 AMnovita: 1.09s at May 18, 12:50 PMnovita: 1.09s at May 19, 12:25 PMnovita: 1.02s at May 20, 12:07 PM
Model detailRoutes
Qwen3 Coder 480B A35B Instruct
alibaba/qwen3-coder
4 providers

Latency

398ms

Uptime

100%

Throughput

95 tok/s

Price

$2.00

Latency p50

4 lines
alibaba: 958ms at May 18, 2:07 AMalibaba: 1.15s at May 18, 12:50 PMalibaba: 1.26s at May 19, 12:25 PMalibaba: 1.39s at May 20, 12:07 PMdeepinfra: 421ms at May 18, 2:07 AMdeepinfra: 492ms at May 18, 12:50 PMdeepinfra: 1.17s at May 19, 12:25 PMdeepinfra: 398ms at May 20, 12:07 PMnovita: 1.18s at May 18, 2:07 AMnovita: 1.2s at May 18, 12:50 PMnovita: 1.25s at May 19, 12:25 PMnovita: 1.23s at May 20, 12:07 PMvertex: 642ms at May 18, 2:07 AMvertex: 644ms at May 18, 12:50 PMvertex: 657ms at May 19, 12:25 PMvertex: 638ms at May 20, 12:07 PM
Model detailRoutes
Qwen 3 Coder 30B A3B Instruct
alibaba/qwen3-coder-30b-a3b
2 providers

Latency

161ms

Uptime

100%

Throughput

116 tok/s

Price

$0.34

Latency p50

2 lines
bedrock: 147ms at May 18, 2:07 AMbedrock: 169ms at May 18, 12:50 PMbedrock: 168ms at May 19, 12:25 PMbedrock: 161ms at May 20, 12:07 PMnovita: 1.18s at May 18, 2:07 AMnovita: 1.13s at May 18, 12:50 PMnovita: 1.18s at May 19, 12:25 PMnovita: 1.2s at May 20, 12:07 PM
Model detailRoutes
Qwen3 Coder Next
alibaba/qwen3-coder-next
2 providers

Latency

2.35s

Uptime

68.75%

Throughput

27.5 tok/s

Price

$1.70

Latency p50

1 lines
bedrock: 853ms at May 18, 2:07 AMbedrock: 157ms at May 18, 12:50 PMbedrock: 1.91s at May 19, 12:25 PMbedrock: 2.35s at May 20, 12:07 PM
Model detailRoutes
Qwen3 Coder Plus
alibaba/qwen3-coder-plus
1 providers

Latency

1.05s

Uptime

100%

Throughput

51 tok/s

Price

$6.00

Latency p50

1 lines
alibaba: 909ms at May 18, 2:07 AMalibaba: 911ms at May 18, 12:50 PMalibaba: 1.29s at May 19, 12:25 PMalibaba: 1.05s at May 20, 12:07 PM
Model detailRoutes
Qwen3 Embedding 0.6B
alibaba/qwen3-embedding-0.6b
1 providers

Latency

-

Uptime

100%

Throughput

-

Price

$0.01

Latency p50

0 lines
Need at least two samples.
Model detailRoutes
Qwen3 Embedding 4B
alibaba/qwen3-embedding-4b
1 providers

Latency

-

Uptime

100%

Throughput

-

Price

$0.02

Latency p50

0 lines
Need at least two samples.
Model detailRoutes
Qwen3 Embedding 8B
alibaba/qwen3-embedding-8b
1 providers

Latency

-

Uptime

100%

Throughput

-

Price

$0.05

Latency p50

0 lines
Need at least two samples.
Model detailRoutes
Qwen3 Max
alibaba/qwen3-max
2 providers

Latency

1.15s

Uptime

100%

Throughput

34 tok/s

Price

$4.23

Latency p50

2 lines
alibaba: 1.44s at May 18, 2:07 AMalibaba: 1.39s at May 18, 12:50 PMalibaba: 1.47s at May 19, 12:25 PMalibaba: 1.29s at May 20, 12:07 PMnovita: 1.13s at May 18, 2:07 AMnovita: 1.14s at May 18, 12:50 PMnovita: 1.25s at May 19, 12:25 PMnovita: 1.15s at May 20, 12:07 PM
Model detailRoutes
Qwen3 Max Preview
alibaba/qwen3-max-preview
1 providers

Latency

1.62s

Uptime

100%

Throughput

44 tok/s

Price

$7.20

Latency p50

1 lines
alibaba: 1.6s at May 18, 2:07 AMalibaba: 1.69s at May 18, 12:50 PMalibaba: 1.62s at May 19, 12:25 PMalibaba: 1.62s at May 20, 12:07 PM
Model detailRoutes
Qwen 3 Max Thinking
alibaba/qwen3-max-thinking
1 providers

Latency

1.26s

Uptime

100%

Throughput

34 tok/s

Price

$7.20

Latency p50

1 lines
alibaba: 1.21s at May 18, 2:07 AMalibaba: 1.39s at May 18, 12:50 PMalibaba: 1.49s at May 19, 12:25 PMalibaba: 1.26s at May 20, 12:07 PM
Model detailRoutes
Qwen3 Next 80B A3B Instruct
alibaba/qwen3-next-80b-a3b-instruct
4 providers

Latency

315ms

Uptime

100%

Throughput

139 tok/s

Price

$1.19

Latency p50

4 lines
alibaba: 488ms at May 18, 2:07 AMalibaba: 886ms at May 18, 12:50 PMalibaba: 928ms at May 19, 12:25 PMalibaba: 909ms at May 20, 12:07 PMdeepinfra: 396ms at May 18, 2:07 AMdeepinfra: 376ms at May 18, 12:50 PMdeepinfra: 433ms at May 19, 12:25 PMdeepinfra: 467ms at May 20, 12:07 PMnovita: 887ms at May 18, 2:07 AMnovita: 896ms at May 18, 12:50 PMnovita: 895ms at May 19, 12:25 PMnovita: 979ms at May 20, 12:07 PMvertex: 283ms at May 18, 2:07 AMvertex: 289ms at May 18, 12:50 PMvertex: 311ms at May 19, 12:25 PMvertex: 315ms at May 20, 12:07 PM
Model detailRoutes
Qwen3 Next 80B A3B Thinking
alibaba/qwen3-next-80b-a3b-thinking
3 providers

Latency

312ms

Uptime

100%

Throughput

436 tok/s

Price

$1.35

Latency p50

3 lines
alibaba: 547ms at May 18, 2:07 AMalibaba: 947ms at May 18, 12:50 PMalibaba: 984ms at May 19, 12:25 PMalibaba: 758ms at May 20, 12:07 PMnovita: 1.01s at May 18, 2:07 AMnovita: 988ms at May 18, 12:50 PMnovita: 985ms at May 19, 12:25 PMnovita: 961ms at May 20, 12:07 PMvertex: 302ms at May 18, 2:07 AMvertex: 304ms at May 18, 12:50 PMvertex: 308ms at May 19, 12:25 PMvertex: 312ms at May 20, 12:07 PM
Model detailRoutes
Qwen3 VL 235B A22B Instruct
alibaba/qwen3-vl-235b-a22b-instruct
2 providers

Latency

428ms

Uptime

100%

Throughput

52 tok/s

Price

$1.08

Latency p50

2 lines
alibaba: 805ms at May 18, 2:07 AMalibaba: 943ms at May 18, 12:50 PMalibaba: 915ms at May 19, 12:25 PMalibaba: 873ms at May 20, 12:07 PMdeepinfra: 372ms at May 18, 2:07 AMdeepinfra: 379ms at May 18, 12:50 PMdeepinfra: 868ms at May 19, 12:25 PMdeepinfra: 428ms at May 20, 12:07 PM
Model detailRoutes
Qwen3 VL 235B A22B Instruct
alibaba/qwen3-vl-instruct
3 providers

Latency

561ms

Uptime

100%

Throughput

53.5 tok/s

Price

$1.08

Latency p50

3 lines
alibaba: 884ms at May 18, 2:07 AMalibaba: 700ms at May 18, 12:50 PMalibaba: 938ms at May 19, 12:25 PMalibaba: 576ms at May 20, 12:07 PMdeepinfra: 388ms at May 18, 2:07 AMdeepinfra: 396ms at May 18, 12:50 PMdeepinfra: 917ms at May 19, 12:25 PMdeepinfra: 561ms at May 20, 12:07 PMnovita: 668ms at May 18, 2:07 AMnovita: 643ms at May 18, 12:50 PMnovita: 667ms at May 19, 12:25 PMnovita: 831ms at May 20, 12:07 PM
Model detailRoutes
Qwen3 VL 235B A22B Thinking
alibaba/qwen3-vl-thinking
2 providers

Latency

1.07s

Uptime

100%

Throughput

91.5 tok/s

Price

$4.40

Latency p50

2 lines
alibaba: 1.28s at May 18, 2:07 AMalibaba: 1.31s at May 18, 12:50 PMalibaba: 1.98s at May 19, 12:25 PMalibaba: 1.81s at May 20, 12:07 PMnovita: 965ms at May 18, 2:07 AMnovita: 1.08s at May 18, 12:50 PMnovita: 1.07s at May 19, 12:25 PMnovita: 1.07s at May 20, 12:07 PM
Model detailRoutes
Qwen 3.5 Flash
alibaba/qwen3.5-flash
1 providers

Latency

994ms

Uptime

100%

Throughput

250 tok/s

Price

$0.50

Latency p50

1 lines
alibaba: 1.02s at May 18, 2:07 AMalibaba: 711ms at May 18, 12:50 PMalibaba: 950ms at May 19, 12:25 PMalibaba: 994ms at May 20, 12:07 PM
Model detailRoutes
Qwen 3.5 Plus
alibaba/qwen3.5-plus
1 providers

Latency

1.47s

Uptime

100%

Throughput

110 tok/s

Price

$2.80

Latency p50

1 lines
alibaba: 1.76s at May 18, 2:07 AMalibaba: 1.58s at May 18, 12:50 PMalibaba: 1.62s at May 19, 12:25 PMalibaba: 1.47s at May 20, 12:07 PM
Model detailRoutes
Qwen 3.6 27B
alibaba/qwen3.6-27b
1 providers

Latency

1.15s

Uptime

100%

Throughput

145 tok/s

Price

$4.20

Latency p50

1 lines
alibaba: 763ms at May 18, 2:07 AMalibaba: 1.16s at May 18, 12:50 PMalibaba: 1.17s at May 19, 12:25 PMalibaba: 1.15s at May 20, 12:07 PM
Model detailRoutes
Related AI tools
Continue the model decision workflow

AI Model Explorer

Browse model pricing, context, providers, and capabilities.

AI Model Comparison

Compare shortlisted models side by side.

LLM Route Finder

Rank provider routes by latency, uptime, throughput, and price.

AI Cost Calculator

Estimate monthly API spend from token usage.

FAQ
AI Model Trend Charts FAQ

Answers for route trend data, latency movement, uptime history, and monitoring caveats.

What are AI model trend charts?

AI model trend charts show how route signals such as latency, uptime, throughput, and token price change across collected endpoint snapshots.

How can trend charts help with model routing?

Trend charts help identify whether a provider route is consistently fast and reliable or only looks good in the latest snapshot.

Why do AI latency trends change over time?

Latency can change because of provider capacity, region, routing infrastructure, model updates, traffic patterns, or measurement conditions.

Should trend data replace live monitoring?

No. Trend data is useful for planning and comparison, but production systems should still use live monitoring, alerts, and fallback routing.