AI Model Explorer

Browse AI models for your next AI feature

Search indexed AI Gateway models by provider, modality, context window, pricing, and capabilities.

Total models

271

Language models

186

Providers indexed

28

Last updated

May 17, 2026

AI Gateway data status
Supabase snapshots ready

Tracked models

10

23 endpoint rows

Chart models

10

3 snapshot hours

Models updated

May 17, 7:37 AM

Fresh Supabase snapshot

Endpoint updated

May 17, 7:37 AM

Newest snapshot May 17, 7:37 AM

Model directory
Showing 271 of 271 indexed models. Select a row to inspect model and endpoint details.

Qwen3-14B

alibaba/qwen-3-14b

language

Provider

alibaba

Context

41K

Max output

16.4K

Input

$0.12 / 1M

Output

$0.24 / 1M

reasoningtool-usetool use

Qwen3 235B A22b Instruct 2507

alibaba/qwen-3-235b

language

Provider

alibaba

Context

131K

Max output

40K

Input

$0.60 / 1M

Output

$1.20 / 1M

tool-useimplicit-cachingtool use

Qwen3-30B-A3B

alibaba/qwen-3-30b

language

Provider

alibaba

Context

41K

Max output

16.4K

Input

$0.08 / 1M

Output

$0.29 / 1M

reasoningtool-usetool use

Qwen 3 32B

alibaba/qwen-3-32b

language

Provider

alibaba

Context

128K

Max output

8.2K

Input

$0.16 / 1M

Output

$0.64 / 1M

reasoningtool-usetool use

Qwen 3.6 Max Preview

alibaba/qwen-3.6-max-preview

language

Provider

alibaba

Context

240K

Max output

64K

Input

$1.30 / 1M

Output

$7.80 / 1M

reasoningtool-useimplicit-cachingfile-inputvision

Qwen3 VL 235B A22B Thinking

alibaba/qwen3-235b-a22b-thinking

language

Provider

alibaba

Context

131.1K

Max output

32.8K

Input

$0.40 / 1M

Output

$4.00 / 1M

visionreasoningtool-usefile-inputfile input

Qwen3 Coder 480B A35B Instruct

alibaba/qwen3-coder

language

Provider

alibaba

Context

262.1K

Max output

65.5K

Input

$1.50 / 1M

Output

$7.50 / 1M

tool-usetool use

Qwen 3 Coder 30B A3B Instruct

alibaba/qwen3-coder-30b-a3b

language

Provider

alibaba

Context

262.1K

Max output

8.2K

Input

$0.15 / 1M

Output

$0.60 / 1M

reasoningtool-usetool use

Qwen3 Coder Next

alibaba/qwen3-coder-next

language

Provider

alibaba

Context

256K

Max output

256K

Input

$0.50 / 1M

Output

$1.20 / 1M

tool-usetool use

Qwen3 Coder Plus

alibaba/qwen3-coder-plus

language

Provider

alibaba

Context

1M

Max output

65.5K

Input

$1.00 / 1M

Output

$5.00 / 1M

tool-usetool use

Qwen3 Embedding 0.6B

alibaba/qwen3-embedding-0.6b

embedding

Provider

alibaba

Context

32.8K

Max output

32.8K

Input

$0.01 / 1M

Output

Qwen3 Embedding 4B

alibaba/qwen3-embedding-4b

embedding

Provider

alibaba

Context

32.8K

Max output

32.8K

Input

$0.02 / 1M

Output

Qwen3 Embedding 8B

alibaba/qwen3-embedding-8b

embedding

Provider

alibaba

Context

32.8K

Max output

32.8K

Input

$0.05 / 1M

Output

Qwen3 Max

alibaba/qwen3-max

language

Provider

alibaba

Context

262.1K

Max output

32.8K

Input

$1.20 / 1M

Output

$6.00 / 1M

tool-useimplicit-cachingtool use

Qwen3 Max Preview

alibaba/qwen3-max-preview

language

Provider

alibaba

Context

262.1K

Max output

32.8K

Input

$1.20 / 1M

Output

$6.00 / 1M

tool-useimplicit-cachingtool use

Qwen 3 Max Thinking

alibaba/qwen3-max-thinking

language

Provider

alibaba

Context

256K

Max output

65.5K

Input

$1.20 / 1M

Output

$6.00 / 1M

reasoningtool-useimplicit-cachingtool use

Qwen3 Next 80B A3B Instruct

alibaba/qwen3-next-80b-a3b-instruct

language

Provider

alibaba

Context

131.1K

Max output

32.8K

Input

$0.15 / 1M

Output

$1.20 / 1M

Qwen3 Next 80B A3B Thinking

alibaba/qwen3-next-80b-a3b-thinking

language

Provider

alibaba

Context

131.1K

Max output

32.8K

Input

$0.15 / 1M

Output

$1.20 / 1M

Qwen3 VL 235B A22B Instruct

alibaba/qwen3-vl-235b-a22b-instruct

language

Provider

alibaba

Context

131.1K

Max output

129K

Input

$0.40 / 1M

Output

$1.60 / 1M

vision

Qwen3 VL 235B A22B Instruct

alibaba/qwen3-vl-instruct

language

Provider

alibaba

Context

131.1K

Max output

129K

Input

$0.40 / 1M

Output

$1.60 / 1M

vision

Qwen3 VL 235B A22B Thinking

alibaba/qwen3-vl-thinking

language

Provider

alibaba

Context

131.1K

Max output

32.8K

Input

$0.40 / 1M

Output

$4.00 / 1M

visionreasoningtool-usefile-inputfile input

Qwen 3.5 Flash

alibaba/qwen3.5-flash

language

Provider

alibaba

Context

1M

Max output

64K

Input

$0.10 / 1M

Output

$0.40 / 1M

visionexplicit-cachingfile-inputreasoningtool-use

Qwen 3.5 Plus

alibaba/qwen3.5-plus

language

Provider

alibaba

Context

1M

Max output

64K

Input

$0.40 / 1M

Output

$2.40 / 1M

visionexplicit-cachingfile-inputreasoningtool-use

Qwen 3.6 27B

alibaba/qwen3.6-27b

language

Provider

alibaba

Context

256K

Max output

256K

Input

$0.60 / 1M

Output

$3.60 / 1M

reasoningtool-useimplicit-cachingfile-inputvision

Qwen 3.6 Plus

alibaba/qwen3.6-plus

language

Provider

alibaba

Context

1M

Max output

64K

Input

$0.50 / 1M

Output

$3.00 / 1M

reasoningtool-useimplicit-cachingvisionfile-input

Wan v2.5 Text-to-Video Preview

alibaba/wan-v2.5-t2v-preview

video

Provider

alibaba

Context

Max output

Input

Output

Wan v2.6 Image-to-Video

alibaba/wan-v2.6-i2v

video

Provider

alibaba

Context

Max output

Input

Output

Wan v2.6 Image-to-Video Flash

alibaba/wan-v2.6-i2v-flash

video

Provider

alibaba

Context

Max output

Input

Output

Wan v2.6 Reference-to-Video

alibaba/wan-v2.6-r2v

video

Provider

alibaba

Context

Max output

Input

Output

Wan v2.6 Reference-to-Video Flash

alibaba/wan-v2.6-r2v-flash

video

Provider

alibaba

Context

Max output

Input

Output

Wan v2.6 Text-to-Video

alibaba/wan-v2.6-t2v

video

Provider

alibaba

Context

Max output

Input

Output

Nova 2 Lite

amazon/nova-2-lite

language

Provider

amazon

Context

1M

Max output

1M

Input

$0.30 / 1M

Output

$2.50 / 1M

reasoningvision

Nova Lite

amazon/nova-lite

language

Provider

amazon

Context

300K

Max output

8.2K

Input

$0.06 / 1M

Output

$0.24 / 1M

Nova Micro

amazon/nova-micro

language

Provider

amazon

Context

128K

Max output

8.2K

Input

$0.035 / 1M

Output

$0.14 / 1M

Nova Pro

amazon/nova-pro

language

Provider

amazon

Context

300K

Max output

8.2K

Input

$0.80 / 1M

Output

$3.20 / 1M

Titan Text Embeddings V2

amazon/titan-embed-text-v2

embedding

Provider

amazon

Context

Max output

Input

$0.02 / 1M

Output

Claude 3 Haiku

anthropic/claude-3-haiku

language

Provider

anthropic

Context

200K

Max output

4.1K

Input

$0.25 / 1M

Output

$1.25 / 1M

tool-usevisionexplicit-cachingtool use

Claude 3.5 Haiku

anthropic/claude-3.5-haiku

language

Provider

anthropic

Context

200K

Max output

8.2K

Input

$0.80 / 1M

Output

$4.00 / 1M

file-inputtool-usevisionexplicit-cachingfile input

Claude Haiku 4.5

anthropic/claude-haiku-4.5

language

Provider

anthropic

Context

200K

Max output

64K

Input

$1.00 / 1M

Output

$5.00 / 1M

file-inputreasoningtool-usevisionexplicit-caching

Claude Opus 4

anthropic/claude-opus-4

language

Provider

anthropic

Context

200K

Max output

32K

Input

$15.00 / 1M

Output

$75.00 / 1M

file-inputreasoningtool-usevisionexplicit-caching

Claude Opus 4.1

anthropic/claude-opus-4.1

language

Provider

anthropic

Context

200K

Max output

32K

Input

$15.00 / 1M

Output

$75.00 / 1M

file-inputreasoningtool-usevisionexplicit-caching

Claude Opus 4.5

anthropic/claude-opus-4.5

language

Provider

anthropic

Context

200K

Max output

64K

Input

$5.00 / 1M

Output

$25.00 / 1M

tool-usereasoningvisionfile-inputexplicit-caching

Claude Opus 4.6

anthropic/claude-opus-4.6

language

Provider

anthropic

Context

1M

Max output

128K

Input

$5.00 / 1M

Output

$25.00 / 1M

tool-usereasoningvisionfile-inputexplicit-caching

Claude Opus 4.7

anthropic/claude-opus-4.7

language

Provider

anthropic

Context

1M

Max output

128K

Input

$5.00 / 1M

Output

$25.00 / 1M

tool-usereasoningvisionfile-inputexplicit-caching

Claude Sonnet 4

anthropic/claude-sonnet-4

language

Provider

anthropic

Context

1M

Max output

64K

Input

$3.00 / 1M

Output

$15.00 / 1M

file-inputreasoningtool-usevisionexplicit-caching

Claude Sonnet 4.5

anthropic/claude-sonnet-4.5

language

Provider

anthropic

Context

1M

Max output

64K

Input

$3.00 / 1M

Output

$15.00 / 1M

file-inputreasoningtool-usevisionexplicit-caching

Claude Sonnet 4.6

anthropic/claude-sonnet-4.6

language

Provider

anthropic

Context

1M

Max output

128K

Input

$3.00 / 1M

Output

$15.00 / 1M

file-inputreasoningtool-usevisionexplicit-caching

Trinity Large Preview

arcee-ai/trinity-large-preview

language

Provider

arcee-ai

Context

131K

Max output

131K

Input

$0.25 / 1M

Output

$1.00 / 1M

tool-usetool use

Trinity Large Thinking

arcee-ai/trinity-large-thinking

language

Provider

arcee-ai

Context

262.1K

Max output

80K

Input

$0.25 / 1M

Output

$0.90 / 1M

reasoningtool-useimplicit-cachingtool use

Trinity Mini

arcee-ai/trinity-mini

language

Provider

arcee-ai

Context

131.1K

Max output

131.1K

Input

$0.045 / 1M

Output

$0.15 / 1M

FLUX.2 [flex]

bfl/flux-2-flex

image

Provider

bfl

Context

Max output

Input

Output

image-generation

FLUX.2 [klein] 4B

bfl/flux-2-klein-4b

image

Provider

bfl

Context

Max output

Input

Output

image-generation

FLUX.2 [klein] 9B

bfl/flux-2-klein-9b

image

Provider

bfl

Context

Max output

Input

Output

image-generation

FLUX.2 [max]

bfl/flux-2-max

image

Provider

bfl

Context

67.3K

Max output

67.3K

Input

Output

image-generation

FLUX.2 [pro]

bfl/flux-2-pro

image

Provider

bfl

Context

67.3K

Max output

67.3K

Input

Output

image-generation

FLUX.1 Kontext Max

bfl/flux-kontext-max

image

Provider

bfl

Context

512

Max output

Input

Output

image-generation

FLUX.1 Kontext Pro

bfl/flux-kontext-pro

image

Provider

bfl

Context

512

Max output

Input

Output

image-generation

FLUX.1 Fill [pro]

bfl/flux-pro-1.0-fill

image

Provider

bfl

Context

Max output

Input

Output

image-generation

FLUX1.1 [pro]

bfl/flux-pro-1.1

image

Provider

bfl

Context

Max output

Input

Output

image-generation

FLUX1.1 [pro] Ultra

bfl/flux-pro-1.1-ultra

image

Provider

bfl

Context

Max output

Input

Output

image-generation

Seed 1.6

bytedance/seed-1.6

language

Provider

bytedance

Context

256K

Max output

32K

Input

$0.25 / 1M

Output

$2.00 / 1M

reasoningtool-useimplicit-cachingtool use

Bytedance Seed 1.8

bytedance/seed-1.8

language

Provider

bytedance

Context

256K

Max output

64K

Input

$0.25 / 1M

Output

$2.00 / 1M

reasoningvisionimplicit-caching

Seedance 2.0

bytedance/seedance-2.0

video

Provider

bytedance

Context

Max output

Input

Output

vision

Seedance 2.0 Fast

bytedance/seedance-2.0-fast

video

Provider

bytedance

Context

Max output

Input

Output

vision

Seedance v1.0 Lite Image-to-Video

bytedance/seedance-v1.0-lite-i2v

video

Provider

bytedance

Context

Max output

Input

Output

Seedance v1.0 Lite Text-to-Video

bytedance/seedance-v1.0-lite-t2v

video

Provider

bytedance

Context

Max output

Input

Output

Seedance v1.0 Pro

bytedance/seedance-v1.0-pro

video

Provider

bytedance

Context

Max output

Input

Output

Seedance v1.0 Pro Fast

bytedance/seedance-v1.0-pro-fast

video

Provider

bytedance

Context

Max output

Input

Output

Seedance v1.5 Pro

bytedance/seedance-v1.5-pro

video

Provider

bytedance

Context

Max output

Input

Output

Seedream 4.0

bytedance/seedream-4.0

image

Provider

bytedance

Context

Max output

Input

Output

image-generation

Seedream 4.5

bytedance/seedream-4.5

image

Provider

bytedance

Context

Max output

Input

Output

image-generation

Seedream 5.0 Lite

bytedance/seedream-5.0-lite

image

Provider

bytedance

Context

Max output

Input

Output

image-generation

Command A

cohere/command-a

language

Provider

cohere

Context

256K

Max output

8K

Input

$2.50 / 1M

Output

$10.00 / 1M

tool-usetool use

Embed v4.0

cohere/embed-v4.0

embedding

Provider

cohere

Context

Max output

Input

$0.12 / 1M

Output

Cohere Rerank 3.5

cohere/rerank-v3.5

reranking

Provider

cohere

Context

4.1K

Max output

4.1K

Input

Output

Cohere Rerank 4 Fast

cohere/rerank-v4-fast

reranking

Provider

cohere

Context

32K

Max output

32K

Input

Output

Cohere Rerank 4 Pro

cohere/rerank-v4-pro

reranking

Provider

cohere

Context

32K

Max output

32K

Input

Output

DeepSeek-R1

deepseek/deepseek-r1

language

Provider

deepseek

Context

128K

Max output

8.2K

Input

$1.35 / 1M

Output

$5.40 / 1M

reasoningtool-usetool use

DeepSeek V3 0324

deepseek/deepseek-v3

language

Provider

deepseek

Context

163.8K

Max output

16.4K

Input

$0.77 / 1M

Output

$0.77 / 1M

tool-usetool use

DeepSeek-V3.1

deepseek/deepseek-v3.1

language

Provider

deepseek

Context

163.8K

Max output

8.2K

Input

$0.56 / 1M

Output

$1.68 / 1M

reasoningtool-usetool use

DeepSeek V3.1 Terminus

deepseek/deepseek-v3.1-terminus

language

Provider

deepseek

Context

131.1K

Max output

65.5K

Input

$0.27 / 1M

Output

$1.00 / 1M

reasoningtool-usetool use

DeepSeek V3.2

deepseek/deepseek-v3.2

language

Provider

deepseek

Context

128K

Max output

8K

Input

$0.28 / 1M

Output

$0.42 / 1M

tool-useimplicit-cachingtool use

DeepSeek V3.2 Thinking

deepseek/deepseek-v3.2-thinking

language

Provider

deepseek

Context

128K

Max output

8K

Input

$0.62 / 1M

Output

$1.85 / 1M

tool-usetool use

DeepSeek V4 Flash

deepseek/deepseek-v4-flash

language

Provider

deepseek

Context

1M

Max output

384K

Input

$0.14 / 1M

Output

$0.28 / 1M

reasoningtool-useimplicit-cachingtool use

DeepSeek V4 Pro

deepseek/deepseek-v4-pro

language

Provider

deepseek

Context

1M

Max output

384K

Input

$0.435 / 1M

Output

$0.87 / 1M

reasoningtool-useimplicit-cachingtool use

Gemini 2.0 Flash

google/gemini-2.0-flash

language

Provider

google

Context

1M

Max output

8.2K

Input

$0.15 / 1M

Output

$0.60 / 1M

file-inputtool-usevisionweb-searchfile input

Gemini 2.0 Flash Lite

google/gemini-2.0-flash-lite

language

Provider

google

Context

1M

Max output

8.2K

Input

$0.075 / 1M

Output

$0.30 / 1M

file-inputtool-usevisionweb-searchfile input

Gemini 2.5 Flash

google/gemini-2.5-flash

language

Provider

google

Context

1M

Max output

65.5K

Input

$0.30 / 1M

Output

$2.50 / 1M

file-inputreasoningtool-usevisionweb-search

Nano Banana (Gemini 2.5 Flash Image)

google/gemini-2.5-flash-image

language

Provider

google

Context

32.8K

Max output

65.5K

Input

$0.30 / 1M

Output

$2.50 / 1M

image-generationweb-searchweb search

Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

language

Provider

google

Context

1M

Max output

65.5K

Input

$0.10 / 1M

Output

$0.40 / 1M

file-inputreasoningtool-usevisionweb-search

Gemini 2.5 Pro

google/gemini-2.5-pro

language

Provider

google

Context

1M

Max output

65.5K

Input

$1.25 / 1M

Output

$10.00 / 1M

file-inputreasoningtool-usevisionweb-search

Gemini 3 Flash

google/gemini-3-flash

language

Provider

google

Context

1M

Max output

65K

Input

$0.50 / 1M

Output

$3.00 / 1M

reasoningtool-usefile-inputvisionweb-search

Nano Banana Pro (Gemini 3 Pro Image)

google/gemini-3-pro-image

language

Provider

google

Context

65.5K

Max output

32.8K

Input

$2.00 / 1M

Output

$12.00 / 1M

image-generationweb-searchweb search

Gemini 3 Pro Preview

google/gemini-3-pro-preview

language

Provider

google

Context

1M

Max output

64K

Input

$2.00 / 1M

Output

$12.00 / 1M

file-inputtool-usereasoningvisionweb-search

Gemini 3.1 Flash Image Preview (Nano Banana 2)

google/gemini-3.1-flash-image-preview

language

Provider

google

Context

131.1K

Max output

32.8K

Input

$0.50 / 1M

Output

$3.00 / 1M

image-generationweb-searchvisionreasoningweb search

Gemini 3.1 Flash Lite

google/gemini-3.1-flash-lite

language

Provider

google

Context

1M

Max output

65K

Input

$0.25 / 1M

Output

$1.50 / 1M

reasoningtool-useimplicit-cachingfile-inputvision

Gemini 3.1 Flash Lite Preview

google/gemini-3.1-flash-lite-preview

language

Provider

google

Context

1M

Max output

65K

Input

$0.25 / 1M

Output

$1.50 / 1M

reasoningtool-useimplicit-cachingvisionfile-input

Gemini 3.1 Pro Preview

google/gemini-3.1-pro-preview

language

Provider

google

Context

1M

Max output

64K

Input

$2.00 / 1M

Output

$12.00 / 1M

file-inputtool-usereasoningvisionweb-search

Gemini Embedding 001

google/gemini-embedding-001

embedding

Provider

google

Context

Max output

Input

$0.15 / 1M

Output

Gemini Embedding 2

google/gemini-embedding-2

embedding

Provider

google

Context

Max output

Input

$0.20 / 1M

Output

Gemma 4 26B A4B IT

google/gemma-4-26b-a4b-it

language

Provider

google

Context

262.1K

Max output

131.1K

Input

$0.13 / 1M

Output

$0.40 / 1M

visiontool-usefile-inputfile inputtool use

Gemma 4 31B IT

google/gemma-4-31b-it

language

Provider

google

Context

262.1K

Max output

131.1K

Input

$0.14 / 1M

Output

$0.40 / 1M

tool-usevisionfile-inputfile inputtool use

Imagen 4 Fast

google/imagen-4.0-fast-generate-001

image

Provider

google

Context

480

Max output

Input

Output

image-generation

Imagen 4

google/imagen-4.0-generate-001

image

Provider

google

Context

480

Max output

Input

Output

image-generation

Imagen 4 Ultra

google/imagen-4.0-ultra-generate-001

image

Provider

google

Context

480

Max output

Input

Output

image-generation

Text Embedding 005

google/text-embedding-005

embedding

Provider

google

Context

Max output

Input

$0.025 / 1M

Output

Text Multilingual Embedding 002

google/text-multilingual-embedding-002

embedding

Provider

google

Context

Max output

Input

$0.025 / 1M

Output

Veo 3.0 Fast Generate

google/veo-3.0-fast-generate-001

video

Provider

google

Context

Max output

Input

Output

Veo 3.0

google/veo-3.0-generate-001

video

Provider

google

Context

Max output

Input

Output

Veo 3.1 Fast Generate

google/veo-3.1-fast-generate-001

video

Provider

google

Context

Max output

Input

Output

Veo 3.1

google/veo-3.1-generate-001

video

Provider

google

Context

Max output

Input

Output

Mercury 2

inception/mercury-2

language

Provider

inception

Context

128K

Max output

128K

Input

$0.25 / 1M

Output

$0.75 / 1M

tool-usereasoningtool use

Mercury Coder Small Beta

inception/mercury-coder-small

language

Provider

inception

Context

32K

Max output

16.4K

Input

$0.25 / 1M

Output

$1.00 / 1M

tool-usetool use

Interfaze Beta

interfaze/interfaze-beta

language

Provider

interfaze

Context

1M

Max output

32K

Input

$1.50 / 1M

Output

$3.50 / 1M

reasoning

Kling v2.5 Turbo Image-to-Video

klingai/kling-v2.5-turbo-i2v

video

Provider

klingai

Context

Max output

Input

Output

Kling v2.5 Turbo Text-to-Video

klingai/kling-v2.5-turbo-t2v

video

Provider

klingai

Context

Max output

Input

Output

Kling v2.6 Image-to-Video

klingai/kling-v2.6-i2v

video

Provider

klingai

Context

Max output

Input

Output

Kling v2.6 Motion Control

klingai/kling-v2.6-motion-control

video

Provider

klingai

Context

Max output

Input

Output

Kling v2.6 Text-to-Video

klingai/kling-v2.6-t2v

video

Provider

klingai

Context

Max output

Input

Output

Kling v3.0 Image-to-Video

klingai/kling-v3.0-i2v

video

Provider

klingai

Context

Max output

Input

Output

Kling v3.0 Motion Control

klingai/kling-v3.0-motion-control

video

Provider

klingai

Context

Max output

Input

Output

Kling v3.0 Text-to-Video

klingai/kling-v3.0-t2v

video

Provider

klingai

Context

Max output

Input

Output

KAT-Coder-Pro V1

kwaipilot/kat-coder-pro-v1

language

Provider

kwaipilot

Context

256K

Max output

32K

Input

$0.03 / 1M

Output

$1.20 / 1M

reasoning

Kat Coder Pro V2

kwaipilot/kat-coder-pro-v2

language

Provider

kwaipilot

Context

256K

Max output

256K

Input

$0.30 / 1M

Output

$1.20 / 1M

tool-usereasoningimplicit-cachingtool use

LongCat Flash Chat

meituan/longcat-flash-chat

language

Provider

meituan

Context

128K

Max output

100K

Input

Output

tool-usetool use

LongCat Flash Thinking 2601

meituan/longcat-flash-thinking-2601

language

Provider

meituan

Context

32.8K

Max output

32.8K

Input

Output

reasoning

Llama 3.1 70B Instruct

meta/llama-3.1-70b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.72 / 1M

Output

$0.72 / 1M

tool-usetool use

Llama 3.1 8B Instruct

meta/llama-3.1-8b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.22 / 1M

Output

$0.22 / 1M

tool-usetool use

Llama 3.2 11B Vision Instruct

meta/llama-3.2-11b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.16 / 1M

Output

$0.16 / 1M

tool-usevisiontool use

Llama 3.2 1B Instruct

meta/llama-3.2-1b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.10 / 1M

Output

$0.10 / 1M

Llama 3.2 3B Instruct

meta/llama-3.2-3b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.15 / 1M

Output

$0.15 / 1M

Llama 3.2 90B Vision Instruct

meta/llama-3.2-90b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.72 / 1M

Output

$0.72 / 1M

tool-usevisiontool use

Llama 3.3 70B Instruct

meta/llama-3.3-70b

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.72 / 1M

Output

$0.72 / 1M

tool-usetool use

Llama 4 Maverick 17B Instruct

meta/llama-4-maverick

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.24 / 1M

Output

$0.97 / 1M

tool-usevisiontool use

Llama 4 Scout 17B Instruct

meta/llama-4-scout

language

Provider

meta

Context

128K

Max output

8.2K

Input

$0.17 / 1M

Output

$0.66 / 1M

tool-usevisiontool use

MiniMax M2

minimax/minimax-m2

language

Provider

minimax

Context

205K

Max output

205K

Input

$0.30 / 1M

Output

$1.20 / 1M

reasoningtool-useimplicit-cachingtool use

MiniMax M2.1

minimax/minimax-m2.1

language

Provider

minimax

Context

204.8K

Max output

131.1K

Input

$0.30 / 1M

Output

$1.20 / 1M

reasoningtool-useimplicit-cachingtool use

MiniMax M2.1 Lightning

minimax/minimax-m2.1-lightning

language

Provider

minimax

Context

204.8K

Max output

131.1K

Input

$0.30 / 1M

Output

$2.40 / 1M

reasoningtool-useimplicit-cachingtool use

MiniMax M2.5

minimax/minimax-m2.5

language

Provider

minimax

Context

204.8K

Max output

131K

Input

$0.30 / 1M

Output

$1.20 / 1M

reasoningtool-useimplicit-cachingtool use

MiniMax M2.5 High Speed

minimax/minimax-m2.5-highspeed

language

Provider

minimax

Context

204.8K

Max output

131K

Input

$0.60 / 1M

Output

$2.40 / 1M

reasoningtool-useimplicit-cachingtool use

Minimax M2.7

minimax/minimax-m2.7

language

Provider

minimax

Context

204.8K

Max output

131K

Input

$0.30 / 1M

Output

$1.20 / 1M

reasoningtool-useimplicit-cachingfile-inputvision

MiniMax M2.7 High Speed

minimax/minimax-m2.7-highspeed

language

Provider

minimax

Context

204.8K

Max output

131.1K

Input

$0.60 / 1M

Output

$2.40 / 1M

reasoningtool-useimplicit-cachingvisiontool use

Mistral Codestral

mistral/codestral

language

Provider

mistral

Context

128K

Max output

4K

Input

$0.30 / 1M

Output

$0.90 / 1M

tool-usetool use

Codestral Embed

mistral/codestral-embed

embedding

Provider

mistral

Context

Max output

Input

$0.15 / 1M

Output

Devstral 2

mistral/devstral-2

language

Provider

mistral

Context

256K

Max output

256K

Input

$0.40 / 1M

Output

$2.00 / 1M

tool-usetool use

Devstral Small 1.1

mistral/devstral-small

language

Provider

mistral

Context

128K

Max output

64K

Input

$0.10 / 1M

Output

$0.30 / 1M

tool-usetool use

Devstral Small 2

mistral/devstral-small-2

language

Provider

mistral

Context

256K

Max output

256K

Input

$0.10 / 1M

Output

$0.30 / 1M

tool-usetool use

Magistral Medium 2509

mistral/magistral-medium

language

Provider

mistral

Context

128K

Max output

64K

Input

$2.00 / 1M

Output

$5.00 / 1M

reasoningvision

Magistral Small 2509

mistral/magistral-small

language

Provider

mistral

Context

128K

Max output

64K

Input

$0.50 / 1M

Output

$1.50 / 1M

reasoningvision

Ministral 14B

mistral/ministral-14b

language

Provider

mistral

Context

256K

Max output

256K

Input

$0.20 / 1M

Output

$0.20 / 1M

visionfile-inputfile input

Ministral 3B

mistral/ministral-3b

language

Provider

mistral

Context

128K

Max output

4K

Input

$0.10 / 1M

Output

$0.10 / 1M

tool-usetool use

Ministral 8B

mistral/ministral-8b

language

Provider

mistral

Context

128K

Max output

4K

Input

$0.15 / 1M

Output

$0.15 / 1M

tool-usetool use

Mistral Embed

mistral/mistral-embed

embedding

Provider

mistral

Context

Max output

Input

$0.10 / 1M

Output

Mistral Large 3

mistral/mistral-large-3

language

Provider

mistral

Context

256K

Max output

256K

Input

$0.50 / 1M

Output

$1.50 / 1M

vision

Mistral Medium 3.1

mistral/mistral-medium

language

Provider

mistral

Context

128K

Max output

64K

Input

$0.40 / 1M

Output

$2.00 / 1M

tool-usevisiontool use

Mistral Nemo 12B

mistral/mistral-nemo

language

Provider

mistral

Context

131.1K

Max output

131.1K

Input

$0.02 / 1M

Output

$0.04 / 1M

Mistral Small

mistral/mistral-small

language

Provider

mistral

Context

32K

Max output

4K

Input

$0.10 / 1M

Output

$0.30 / 1M

tool-usevisiontool use

Pixtral 12B 2409

mistral/pixtral-12b

language

Provider

mistral

Context

128K

Max output

4K

Input

$0.15 / 1M

Output

$0.15 / 1M

tool-usevisiontool use

Pixtral Large

mistral/pixtral-large

language

Provider

mistral

Context

128K

Max output

4K

Input

$2.00 / 1M

Output

$6.00 / 1M

tool-usevisiontool use

Kimi K2 Instruct

moonshotai/kimi-k2

language

Provider

moonshotai

Context

131.1K

Max output

131.1K

Input

$0.57 / 1M

Output

$2.30 / 1M

tool-usetool use

Kimi K2 Thinking

moonshotai/kimi-k2-thinking

language

Provider

moonshotai

Context

262.1K

Max output

262.1K

Input

$0.60 / 1M

Output

$2.50 / 1M

reasoningtool-useimplicit-cachingtool use

Kimi K2 Thinking Turbo

moonshotai/kimi-k2-thinking-turbo

language

Provider

moonshotai

Context

262.1K

Max output

262.1K

Input

$1.15 / 1M

Output

$8.00 / 1M

reasoningtool-useimplicit-cachingtool use

Kimi K2 Turbo

moonshotai/kimi-k2-turbo

language

Provider

moonshotai

Context

256K

Max output

16.4K

Input

$1.15 / 1M

Output

$8.00 / 1M

tool-usetool use

Kimi K2.5

moonshotai/kimi-k2.5

language

Provider

moonshotai

Context

262.1K

Max output

262.1K

Input

$0.60 / 1M

Output

$3.00 / 1M

reasoningvisiontool-useimplicit-cachingtool use

Kimi K2.6

moonshotai/kimi-k2.6

language

Provider

moonshotai

Context

262K

Max output

262K

Input

$0.95 / 1M

Output

$4.00 / 1M

reasoningtool-usevisionfile-inputimplicit-caching

Morph V3 Fast

morph/morph-v3-fast

language

Provider

morph

Context

81.9K

Max output

16.4K

Input

$0.80 / 1M

Output

$1.20 / 1M

Morph V3 Large

morph/morph-v3-large

language

Provider

morph

Context

81.9K

Max output

16.4K

Input

$0.90 / 1M

Output

$1.90 / 1M

Nemotron 3 Nano 30B A3B

nvidia/nemotron-3-nano-30b-a3b

language

Provider

nvidia

Context

262.1K

Max output

262.1K

Input

$0.05 / 1M

Output

$0.24 / 1M

reasoning

NVIDIA Nemotron 3 Super 120B A12B

nvidia/nemotron-3-super-120b-a12b

language

Provider

nvidia

Context

256K

Max output

32K

Input

$0.15 / 1M

Output

$0.65 / 1M

Nvidia Nemotron Nano 12B V2 VL

nvidia/nemotron-nano-12b-v2-vl

language

Provider

nvidia

Context

131.1K

Max output

131.1K

Input

$0.20 / 1M

Output

$0.60 / 1M

visionreasoningtool-usetool use

Nvidia Nemotron Nano 9B V2

nvidia/nemotron-nano-9b-v2

language

Provider

nvidia

Context

131.1K

Max output

131.1K

Input

$0.06 / 1M

Output

$0.23 / 1M

reasoningtool-usetool use

GPT-3.5 Turbo

openai/gpt-3.5-turbo

language

Provider

openai

Context

16.4K

Max output

4.1K

Input

$0.50 / 1M

Output

$1.50 / 1M

GPT-3.5 Turbo Instruct

openai/gpt-3.5-turbo-instruct

language

Provider

openai

Context

8.2K

Max output

4.1K

Input

$1.50 / 1M

Output

$2.00 / 1M

GPT-4 Turbo

openai/gpt-4-turbo

language

Provider

openai

Context

128K

Max output

4.1K

Input

$10.00 / 1M

Output

$30.00 / 1M

tool-usevisiontool use

GPT-4.1

openai/gpt-4.1

language

Provider

openai

Context

1M

Max output

32.8K

Input

$2.00 / 1M

Output

$8.00 / 1M

file-inputtool-usevisionimplicit-cachingweb-search

GPT-4.1 mini

openai/gpt-4.1-mini

language

Provider

openai

Context

1M

Max output

32.8K

Input

$0.40 / 1M

Output

$1.60 / 1M

file-inputtool-usevisionimplicit-cachingweb-search

GPT-4.1 nano

openai/gpt-4.1-nano

language

Provider

openai

Context

1M

Max output

32.8K

Input

$0.10 / 1M

Output

$0.40 / 1M

file-inputtool-usevisionimplicit-cachingweb-search

GPT-4o

openai/gpt-4o

language

Provider

openai

Context

128K

Max output

16.4K

Input

$2.50 / 1M

Output

$10.00 / 1M

file-inputtool-usevisionimplicit-cachingweb-search

GPT-4o mini

openai/gpt-4o-mini

language

Provider

openai

Context

128K

Max output

16.4K

Input

$0.15 / 1M

Output

$0.60 / 1M

file-inputtool-usevisionimplicit-cachingweb-search

GPT 4o Mini Search Preview

openai/gpt-4o-mini-search-preview

language

Provider

openai

Context

128K

Max output

16.4K

Input

$0.15 / 1M

Output

$0.60 / 1M

web-searchweb search

GPT-5

openai/gpt-5

language

Provider

openai

Context

400K

Max output

128K

Input

$1.25 / 1M

Output

$10.00 / 1M

file-inputimplicit-cachingreasoningtool-usevision

GPT 5 Chat

openai/gpt-5-chat

language

Provider

openai

Context

128K

Max output

16.4K

Input

$1.25 / 1M

Output

$10.00 / 1M

tool-useimplicit-cachingfile-inputimage-generationvision

GPT-5-Codex

openai/gpt-5-codex

language

Provider

openai

Context

400K

Max output

128K

Input

$1.25 / 1M

Output

$10.00 / 1M

file-inputimplicit-cachingreasoningtool-useweb-search

GPT-5 mini

openai/gpt-5-mini

language

Provider

openai

Context

400K

Max output

128K

Input

$0.25 / 1M

Output

$2.00 / 1M

file-inputimplicit-cachingreasoningtool-usevision

GPT-5 nano

openai/gpt-5-nano

language

Provider

openai

Context

400K

Max output

128K

Input

$0.05 / 1M

Output

$0.40 / 1M

file-inputimplicit-cachingreasoningtool-usevision

GPT-5 pro

openai/gpt-5-pro

language

Provider

openai

Context

400K

Max output

272K

Input

$15.00 / 1M

Output

$120.00 / 1M

file-inputimplicit-cachingreasoningtool-usevision

GPT-5.1-Codex

openai/gpt-5.1-codex

language

Provider

openai

Context

400K

Max output

128K

Input

$1.25 / 1M

Output

$10.00 / 1M

file-inputtool-usereasoningvisionweb-search

GPT 5.1 Codex Max

openai/gpt-5.1-codex-max

language

Provider

openai

Context

400K

Max output

128K

Input

$1.25 / 1M

Output

$10.00 / 1M

reasoningfile-inputtool-usevisionweb-search

GPT 5.1 Codex Mini

openai/gpt-5.1-codex-mini

language

Provider

openai

Context

400K

Max output

128K

Input

$0.25 / 1M

Output

$2.00 / 1M

reasoningfile-inputvisiontool-useweb-search

GPT-5.1 Instant

openai/gpt-5.1-instant

language

Provider

openai

Context

128K

Max output

16.4K

Input

$1.25 / 1M

Output

$10.00 / 1M

tool-usevisionfile-inputreasoningimplicit-caching

GPT 5.1 Thinking

openai/gpt-5.1-thinking

language

Provider

openai

Context

400K

Max output

128K

Input

$1.25 / 1M

Output

$10.00 / 1M

tool-useimplicit-cachingfile-inputreasoningvision

GPT 5.2

openai/gpt-5.2

language

Provider

openai

Context

400K

Max output

128K

Input

$1.75 / 1M

Output

$14.00 / 1M

tool-usevisionfile-inputreasoningimplicit-caching

GPT 5.2 Chat

openai/gpt-5.2-chat

language

Provider

openai

Context

128K

Max output

16.4K

Input

$1.75 / 1M

Output

$14.00 / 1M

visionfile-inputtool-usereasoningimplicit-caching

GPT 5.2 Codex

openai/gpt-5.2-codex

language

Provider

openai

Context

400K

Max output

128K

Input

$1.75 / 1M

Output

$14.00 / 1M

file-inputtool-usereasoningvisionweb-search

GPT 5.2

openai/gpt-5.2-pro

language

Provider

openai

Context

400K

Max output

128K

Input

$21.00 / 1M

Output

$168.00 / 1M

tool-usevisionimplicit-cachingreasoningfile-input

GPT-5.3 Chat

openai/gpt-5.3-chat

language

Provider

openai

Context

128K

Max output

16.4K

Input

$1.75 / 1M

Output

$14.00 / 1M

visionfile-inputtool-usereasoningimplicit-caching

GPT 5.3 Codex

openai/gpt-5.3-codex

language

Provider

openai

Context

400K

Max output

128K

Input

$1.75 / 1M

Output

$14.00 / 1M

reasoningtool-usefile-inputvisionweb-search

GPT 5.4

openai/gpt-5.4

language

Provider

openai

Context

1.1M

Max output

128K

Input

$2.50 / 1M

Output

$15.00 / 1M

reasoningtool-usevisionfile-inputimplicit-caching

GPT 5.4 Mini

openai/gpt-5.4-mini

language

Provider

openai

Context

400K

Max output

128K

Input

$0.75 / 1M

Output

$4.50 / 1M

reasoningtool-usevisionfile-inputimplicit-caching

GPT 5.4 Nano

openai/gpt-5.4-nano

language

Provider

openai

Context

400K

Max output

128K

Input

$0.20 / 1M

Output

$1.25 / 1M

reasoningtool-useimplicit-cachingweb-searchvision

GPT 5.4 Pro

openai/gpt-5.4-pro

language

Provider

openai

Context

1.1M

Max output

128K

Input

$30.00 / 1M

Output

$180.00 / 1M

reasoningtool-usevisionfile-inputimplicit-caching

GPT 5.5

openai/gpt-5.5

language

Provider

openai

Context

1M

Max output

128K

Input

$5.00 / 1M

Output

$30.00 / 1M

reasoningtool-useweb-searchimplicit-cachingfile-input

GPT 5.5 Pro

openai/gpt-5.5-pro

language

Provider

openai

Context

1M

Max output

128K

Input

$30.00 / 1M

Output

$180.00 / 1M

reasoningtool-useimplicit-cachingfile-inputweb-search

GPT Image 1

openai/gpt-image-1

image

Provider

openai

Context

Max output

Input

$5.00 / 1M

Output

$40.00 / 1M

image-generation

GPT Image 1 Mini

openai/gpt-image-1-mini

image

Provider

openai

Context

Max output

Input

$2.00 / 1M

Output

$8.00 / 1M

image-generation

GPT Image 1.5

openai/gpt-image-1.5

image

Provider

openai

Context

Max output

Input

$5.00 / 1M

Output

$32.00 / 1M

image-generation

GPT Image 2

openai/gpt-image-2

image

Provider

openai

Context

Max output

Input

$5.00 / 1M

Output

$30.00 / 1M

image-generation

GPT OSS 120B

openai/gpt-oss-120b

language

Provider

openai

Context

131.1K

Max output

131K

Input

$0.35 / 1M

Output

$0.75 / 1M

implicit-caching

GPT OSS 20B

openai/gpt-oss-20b

language

Provider

openai

Context

131.1K

Max output

8.2K

Input

$0.05 / 1M

Output

$0.20 / 1M

reasoningtool-usetool use

GPT OSS Safeguard 20B

openai/gpt-oss-safeguard-20b

language

Provider

openai

Context

131.1K

Max output

65.5K

Input

$0.075 / 1M

Output

$0.30 / 1M

reasoningtool-usetool use

o1

openai/o1

language

Provider

openai

Context

200K

Max output

100K

Input

$15.00 / 1M

Output

$60.00 / 1M

file-inputreasoningtool-usevisionimplicit-caching

o3

openai/o3

language

Provider

openai

Context

200K

Max output

100K

Input

$2.00 / 1M

Output

$8.00 / 1M

file-inputreasoningtool-usevisionimplicit-caching

o3-deep-research

openai/o3-deep-research

language

Provider

openai

Context

200K

Max output

100K

Input

$10.00 / 1M

Output

$40.00 / 1M

reasoningfile-inputtool-usevisionimplicit-caching

o3-mini

openai/o3-mini

language

Provider

openai

Context

200K

Max output

100K

Input

$1.10 / 1M

Output

$4.40 / 1M

reasoningtool-useimplicit-cachingtool use

o3 Pro

openai/o3-pro

language

Provider

openai

Context

200K

Max output

100K

Input

$20.00 / 1M

Output

$80.00 / 1M

reasoningvisionfile-inputtool-useweb-search

o4-mini

openai/o4-mini

language

Provider

openai

Context

200K

Max output

100K

Input

$1.10 / 1M

Output

$4.40 / 1M

file-inputreasoningtool-usevisionimplicit-caching

text-embedding-3-large

openai/text-embedding-3-large

embedding

Provider

openai

Context

Max output

Input

$0.13 / 1M

Output

text-embedding-3-small

openai/text-embedding-3-small

embedding

Provider

openai

Context

Max output

Input

$0.02 / 1M

Output

text-embedding-ada-002

openai/text-embedding-ada-002

embedding

Provider

openai

Context

Max output

Input

$0.10 / 1M

Output

Sonar

perplexity/sonar

language

Provider

perplexity

Context

127K

Max output

8K

Input

Output

tool-usevisiontool use

Sonar Pro

perplexity/sonar-pro

language

Provider

perplexity

Context

200K

Max output

8K

Input

Output

tool-usevisiontool use

Sonar Reasoning Pro

perplexity/sonar-reasoning-pro

language

Provider

perplexity

Context

127K

Max output

8K

Input

Output

reasoning

Flux Schnell

prodia/flux-fast-schnell

image

Provider

prodia

Context

512

Max output

Input

Output

image-generation

Recraft V2

recraft/recraft-v2

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V3

recraft/recraft-v3

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V4

recraft/recraft-v4

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V4 Pro

recraft/recraft-v4-pro

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V4.1

recraft/recraft-v4.1

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V4.1 Pro

recraft/recraft-v4.1-pro

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V4.1 Utility

recraft/recraft-v4.1-utility

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Recraft V4.1 Utility Pro

recraft/recraft-v4.1-utility-pro

image

Provider

recraft

Context

Max output

Input

Output

image-generation

Voyage Rerank 2.5

voyage/rerank-2.5

reranking

Provider

voyage

Context

32K

Max output

32K

Input

$0.05 / 1M

Output

Voyage Rerank 2.5 Lite

voyage/rerank-2.5-lite

reranking

Provider

voyage

Context

32K

Max output

32K

Input

$0.02 / 1M

Output

voyage-3-large

voyage/voyage-3-large

embedding

Provider

voyage

Context

Max output

Input

$0.18 / 1M

Output

Voyage 3.5

voyage/voyage-3.5

embedding

Provider

voyage

Context

Max output

Input

$0.06 / 1M

Output

Voyage 3.5 Lite

voyage/voyage-3.5-lite

embedding

Provider

voyage

Context

Max output

Input

$0.02 / 1M

Output

Voyage 4

voyage/voyage-4

embedding

Provider

voyage

Context

32K

Max output

Input

$0.06 / 1M

Output

Voyage 4 Large

voyage/voyage-4-large

embedding

Provider

voyage

Context

32K

Max output

Input

$0.12 / 1M

Output

Voyage 4 Lite

voyage/voyage-4-lite

embedding

Provider

voyage

Context

32K

Max output

Input

$0.02 / 1M

Output

Voyage Code 2

voyage/voyage-code-2

embedding

Provider

voyage

Context

Max output

Input

$0.12 / 1M

Output

Voyage Code 3

voyage/voyage-code-3

embedding

Provider

voyage

Context

Max output

Input

$0.18 / 1M

Output

Voyage Finance 2

voyage/voyage-finance-2

embedding

Provider

voyage

Context

Max output

Input

$0.12 / 1M

Output

Voyage Law 2

voyage/voyage-law-2

embedding

Provider

voyage

Context

Max output

Input

$0.12 / 1M

Output

Grok 4.1 Fast Non-Reasoning

xai/grok-4.1-fast-non-reasoning

language

Provider

xai

Context

1M

Max output

1M

Input

$0.20 / 1M

Output

$0.50 / 1M

tool-usefile-inputvisionimplicit-cachingfile input

Grok 4.1 Fast Reasoning

xai/grok-4.1-fast-reasoning

language

Provider

xai

Context

1M

Max output

1M

Input

$0.20 / 1M

Output

$0.50 / 1M

reasoningfile-inputvisiontool-useimplicit-caching

Grok 4.20 Multi-Agent

xai/grok-4.20-multi-agent

language

Provider

xai

Context

2M

Max output

2M

Input

$1.25 / 1M

Output

$2.50 / 1M

reasoningtool-useimplicit-cachingvisionfile-input

Grok 4.20 Multi Agent Beta

xai/grok-4.20-multi-agent-beta

language

Provider

xai

Context

2M

Max output

2M

Input

$1.25 / 1M

Output

$2.50 / 1M

reasoningtool-useimplicit-cachingvisionfile-input

Grok 4.20 Non-Reasoning

xai/grok-4.20-non-reasoning

language

Provider

xai

Context

2M

Max output

2M

Input

$1.25 / 1M

Output

$2.50 / 1M

tool-useimplicit-cachingvisionfile-inputweb-search

Grok 4.20 Beta Non-Reasoning

xai/grok-4.20-non-reasoning-beta

language

Provider

xai

Context

2M

Max output

2M

Input

$1.25 / 1M

Output

$2.50 / 1M

tool-useimplicit-cachingvisionfile-inputweb-search

Grok 4.20 Reasoning

xai/grok-4.20-reasoning

language

Provider

xai

Context

2M

Max output

2M

Input

$1.25 / 1M

Output

$2.50 / 1M

reasoningvisiontool-usefile-inputimplicit-caching

Grok 4.20 Beta Reasoning

xai/grok-4.20-reasoning-beta

language

Provider

xai

Context

2M

Max output

2M

Input

$1.25 / 1M

Output

$2.50 / 1M

reasoningtool-usevisionfile-inputimplicit-caching

Grok 4.3

xai/grok-4.3

language

Provider

xai

Context

1M

Max output

1M

Input

$1.25 / 1M

Output

$2.50 / 1M

reasoningtool-useimplicit-cachingfile-inputvision

Grok Imagine Image

xai/grok-imagine-image

image

Provider

xai

Context

Max output

Input

Output

image-generation

Grok Imagine

xai/grok-imagine-video

video

Provider

xai

Context

Max output

Input

Output

MiMo V2 Flash

xiaomi/mimo-v2-flash

language

Provider

xiaomi

Context

262.1K

Max output

32K

Input

$0.10 / 1M

Output

$0.30 / 1M

reasoningtool-usetool use

MiMo V2 Pro

xiaomi/mimo-v2-pro

language

Provider

xiaomi

Context

1M

Max output

128K

Input

$1.00 / 1M

Output

$3.00 / 1M

reasoningtool-usetool use

MiMo M2.5

xiaomi/mimo-v2.5

language

Provider

xiaomi

Context

1.1M

Max output

131.1K

Input

$0.40 / 1M

Output

$2.00 / 1M

reasoningtool-useimplicit-cachingfile-inputvision

MiMo V2.5 Pro

xiaomi/mimo-v2.5-pro

language

Provider

xiaomi

Context

1.1M

Max output

131K

Input

$1.00 / 1M

Output

$3.00 / 1M

reasoningtool-usevisionfile-inputimplicit-caching

GLM-4.5

zai/glm-4.5

language

Provider

zai

Context

128K

Max output

96K

Input

$0.60 / 1M

Output

$2.20 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 4.5 Air

zai/glm-4.5-air

language

Provider

zai

Context

128K

Max output

96K

Input

$0.20 / 1M

Output

$1.10 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 4.5V

zai/glm-4.5v

language

Provider

zai

Context

66K

Max output

16K

Input

$0.60 / 1M

Output

$1.80 / 1M

tool-usevisionimplicit-cachingtool use

GLM 4.6

zai/glm-4.6

language

Provider

zai

Context

200K

Max output

96K

Input

$0.60 / 1M

Output

$2.20 / 1M

reasoningtool-useimplicit-cachingtool use

GLM-4.6V

zai/glm-4.6v

language

Provider

zai

Context

128K

Max output

24K

Input

$0.30 / 1M

Output

$0.90 / 1M

visionfile-inputreasoningtool-useimplicit-caching

GLM-4.6V-Flash

zai/glm-4.6v-flash

language

Provider

zai

Context

128K

Max output

24K

Input

Output

visionreasoningfile-inputtool-useimplicit-caching

GLM 4.7

zai/glm-4.7

language

Provider

zai

Context

131K

Max output

40K

Input

$2.25 / 1M

Output

$2.75 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 4.7 Flash

zai/glm-4.7-flash

language

Provider

zai

Context

200K

Max output

131K

Input

$0.07 / 1M

Output

$0.40 / 1M

reasoningtool-usetool use

GLM 4.7 FlashX

zai/glm-4.7-flashx

language

Provider

zai

Context

200K

Max output

128K

Input

$0.06 / 1M

Output

$0.40 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 5

zai/glm-5

language

Provider

zai

Context

202.8K

Max output

131.1K

Input

$1.00 / 1M

Output

$3.20 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 5 Turbo

zai/glm-5-turbo

language

Provider

zai

Context

202.8K

Max output

131.1K

Input

$1.20 / 1M

Output

$4.00 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 5.1

zai/glm-5.1

language

Provider

zai

Context

202.8K

Max output

64K

Input

$1.40 / 1M

Output

$4.40 / 1M

reasoningtool-useimplicit-cachingtool use

GLM 5V Turbo

zai/glm-5v-turbo

language

Provider

zai

Context

200K

Max output

128K

Input

$1.20 / 1M

Output

$4.00 / 1M

reasoningtool-useimplicit-cachingvisionfile-input

Model data for planning, not final procurement

Model data is read from Supabase AI Gateway snapshots generated by the ingestion pipeline. Pricing and availability may change, so verify with official providers before production use.

Back to AI Developer Tools