✨ Updated on August 3rd, 2025 - Latest LLM data and pricing

Configure Your Search

Advanced Filters

Quality Index

Filter models by their quality score

0068

Maximum Price

Filter models by price per million tokens

$0$100K$94
0
Min Quality
$100K
Max Price
Total Models
512
Avg Price
$2.47
Providers
70
Creators
21

Interactive Model Comparison512 models

Detailed Model Data

512 Results
Sorted by model (ascending)
Model Name
Creator
Provider
Quality Index
Context Window
Price/1M Tokens
Speed (tokens/s)
Latency (s)
Aya Expanse 32B
Aya Expanse 32B
CohereCohere
11
128,000
$0.75
118.8
0.17s
Aya Expanse 8B
Aya Expanse 8B
CohereCohere
7
8,000
$0.75
167.3
0.21s
Claude 2.1
Claude 2.1
AnthropicAmazon Bedrock
24
200,000
$12.00
29.6
1.81s
Claude 2.1
Claude 2.1
AnthropicAnthropic
24
200,000
$12.00
13.7
0.92s
Claude 3 Haiku
Claude 3 Haiku
AnthropicAmazon Bedrock
12
200,000
$0.50
113.8
0.85s
Claude 3 Haiku
Claude 3 Haiku
AnthropicAnthropic
12
200,000
$0.50
137.7
0.39s
Claude 3 Opus
Claude 3 Opus
AnthropicAmazon Bedrock
48
200,000
$30.00
27.1
1.20s
Claude 3 Opus Vertex
Claude 3 Opus
AnthropicGoogle Vertex
48
200,000
$30.00
21.2
2.42s
Claude 3 Opus
Claude 3 Opus
AnthropicAnthropic
48
200,000
$30.00
28.3
1.15s
Claude 3 Sonnet
Claude 3 Sonnet
AnthropicAmazon Bedrock
19
200,000
$6.00
64.8
0.74s
Claude 3 Sonnet
Claude 3 Sonnet
AnthropicAnthropic
19
200,000
$6.00
59.4
0.56s
Claude 3.5 Haiku Standard
Claude 3.5 Haiku
AnthropicAmazon Bedrock (Standard)
26
200,000
$1.60
54.4
1.32s
Claude 3.5 Haiku Latency Optimized
Claude 3.5 Haiku
AnthropicAmazon Bedrock (Latency Optimized)
26
200,000
$2.00
92.8
0.51s
Claude 3.5 Haiku Vertex
Claude 3.5 Haiku
AnthropicGoogle Vertex
26
200,000
$1.60
65.9
0.76s
Claude 3.5 Haiku
Claude 3.5 Haiku
AnthropicAnthropic
26
200,000
$1.60
64.1
1.34s
Claude 3.5 Sonnet (June) Vertex
Claude 3.5 Sonnet (June)
AnthropicGoogle Vertex
29
200,000
$6.00
80.0
0.84s
Claude 3.5 Sonnet (Oct)
Claude 3.5 Sonnet (Oct)
AnthropicAmazon Bedrock
36
200,000
$6.00
49.9
0.95s
Claude 3.5 Sonnet (Oct) Vertex
Claude 3.5 Sonnet (Oct)
AnthropicGoogle Vertex
36
200,000
$6.00
79.6
0.87s
Claude 3.5 Sonnet (Oct)
Claude 3.5 Sonnet (Oct)
AnthropicAnthropic
36
200,000
$6.00
77.8
1.41s
Claude 3.7 Sonnet
Claude 3.7 Sonnet
AnthropicAmazon Bedrock
37
200,000
$6.00
53.0
1.05s
Claude 3.7 Sonnet Vertex
Claude 3.7 Sonnet
AnthropicGoogle Vertex
37
200,000
$6.00
78.1
0.91s
Claude 3.7 Sonnet
Claude 3.7 Sonnet
AnthropicAnthropic
37
200,000
$6.00
78.7
1.19s
Claude 3.7 Sonnet Thinking
Claude 3.7 Sonnet Thinking
AnthropicAmazon Bedrock
57
200,000
$6.00
76.3
1.48s
Claude 3.7 Sonnet Thinking
Claude 3.7 Sonnet Thinking
AnthropicAnthropic
57
200,000
$6.00
88.3
1.37s
Claude 4 Opus
Claude 4 Opus
AnthropicAmazon Bedrock
48
200,000
$30.00
22.4
3.64s
Claude 4 Opus Vertex
Claude 4 Opus
AnthropicGoogle Vertex
48
200,000
$30.00
63.2
1.62s
Claude 4 Opus
Claude 4 Opus
AnthropicAnthropic
48
200,000
$30.00
63.8
1.84s
Claude 4 Opus Thinking
Claude 4 Opus Thinking
AnthropicAmazon Bedrock
58
200,000
$30.00
19.1
3.47s
Claude 4 Opus Thinking Vertex
Claude 4 Opus Thinking
AnthropicGoogle Vertex
58
200,000
$30.00
59.2
1.62s
Claude 4 Opus Thinking
Claude 4 Opus Thinking
AnthropicAnthropic
58
200,000
$30.00
65.5
1.83s
Claude 4 Sonnet
Claude 4 Sonnet
AnthropicAmazon Bedrock
46
200,000
$6.00
63.8
1.30s
Claude 4 Sonnet Vertex
Claude 4 Sonnet
AnthropicGoogle Vertex
46
200,000
$6.00
83.8
1.25s
Claude 4 Sonnet
Claude 4 Sonnet
AnthropicAnthropic
46
200,000
$6.00
80.3
1.63s
Claude 4 Sonnet Thinking
Claude 4 Sonnet Thinking
AnthropicAmazon Bedrock
59
200,000
$6.00
44.0
1.17s
Claude 4 Sonnet Thinking Vertex
Claude 4 Sonnet Thinking
AnthropicGoogle Vertex
59
200,000
$6.00
72.8
1.17s
Claude 4 Sonnet Thinking
Claude 4 Sonnet Thinking
AnthropicAnthropic
59
200,000
$6.00
85.0
1.44s
Codestral (Jan '25)
Codestral (Jan '25)
MistralMistral
19
256,000
$0.45
163.8
0.30s
Codestral (Jan '25) Vertex
Codestral (Jan '25)
MistralGoogle Vertex
19
128,000
$0.45
150.5
0.14s
Command A
Command A
CohereCohere
34
256,000
$4.38
165.6
0.21s
Command-R
Command-R
CohereAmazon Bedrock
15
128,000
$0.75
107.6
0.34s
Command-R
Command-R
CohereCohere
15
128,000
$0.26
72.1
0.19s
Command-R (Mar '24)
Command-R (Mar '24)
CohereAmazon Bedrock
15
128,000
$0.75
107.2
0.33s
Command-R (Mar '24)
Command-R (Mar '24)
CohereCohere
15
128,000
$0.75
179.0
0.15s
Command-R+
Command-R+
CohereAmazon Bedrock
21
128,000
$6.00
48.1
0.49s
Command-R+
Command-R+
CohereCohere
21
128,000
$4.38
48.6
0.26s
Command-R+ (Apr '24)
Command-R+ (Apr '24)
CohereAmazon Bedrock
20
128,000
$6.00
48.1
0.49s
Command-R+ (Apr '24)
Command-R+ (Apr '24)
CohereCohere
20
128,000
$6.00
58.8
0.22s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekLambda
50
164,000
$0.95
39.8
0.33s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekHyperbolic
50
128,000
$2.00
94.1
0.96s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekAmazon Bedrock
50
128,000
$2.36
229.3
0.37s
DeepSeek R1 (Jan '25) Base
DeepSeek R1 (Jan '25)
DeepSeekNebius (Base)
50
128,000
$1.20
35.0
0.62s
DeepSeek R1 (Jan '25) Fast
DeepSeek R1 (Jan '25)
DeepSeekNebius (Fast)
50
128,000
$3.00
80.9
0.65s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekMicrosoft Azure
50
128,000
$2.36
120.8
0.78s
DeepSeek R1 (Jan '25) (Fast)
DeepSeek R1 (Jan '25)
DeepSeekFireworks (Fast)
50
164,000
$4.25
111.1
0.55s
DeepSeek R1 (Jan '25) (Base)
DeepSeek R1 (Jan '25)
DeepSeekFireworks (Base)
50
164,000
$0.96
90.4
0.50s
DeepSeek R1 (Jan '25) (Turbo, FP4)
DeepSeek R1 (Jan '25)
DeepSeekDeepinfra (Turbo, FP4)
50
33,000
$1.50
207.2
0.24s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekDeepinfra
50
64,000
$0.88
122.2
0.26s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekFriendliAI
50
128,000
$4.00
94.2
0.48s
DeepSeek R1 (Jan '25) Turbo
DeepSeek R1 (Jan '25)
DeepSeekNovita (Turbo)
50
64,000
$1.15
32.6
0.79s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekNovita
50
64,000
$4.00
32.8
0.76s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekTogether.ai
50
128,000
$4.00
335.1
0.68s
DeepSeek R1 (Jan '25)
DeepSeek R1 (Jan '25)
DeepSeekkluster.ai
50
128,000
$3.50
76.8
0.52s
DeepSeek R1 0528
DeepSeek R1 0528
DeepSeekLambda
59
128,000
$0.96
22.2
3.39s
DeepSeek R1 0528
DeepSeek R1 0528
DeepSeekDeepSeek
59
128,000
$0.96
22.2
3.39s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekParasail
59
164,000
$1.59
109.8
0.45s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekHyperbolic
59
164,000
$3.00
105.2
0.99s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekNebius AI Studio base
59
164,000
$1.00
103.1
0.61s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekCentML
59
64,000
$0.00
87.5
0.82s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekMicrosoft Azure
59
128,000
$2.36
119.0
0.75s
DeepSeek R1 0528 (May '25) Fast
DeepSeek R1 0528 (May '25)
DeepSeekFireworks (Fast)
59
164,000
$4.25
267.1
0.47s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekDeepinfra
59
164,000
$0.91
84.2
0.28s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekNovita
59
128,000
$1.15
117.3
0.56s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekGMI
59
131,000
$1.18
63.6
0.61s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekSambaNova
59
33,000
$5.50
131.8
4.57s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekTogether.ai
59
128,000
$4.00
338.5
0.69s
DeepSeek R1 0528 (May '25) (Throughput)
DeepSeek R1 0528 (May '25)
DeepSeekTogether.ai (Throughput)
59
128,000
$0.96
24.2
1.92s
DeepSeek R1 0528 (May '25)
DeepSeek R1 0528 (May '25)
DeepSeekkluster.ai
59
164,000
$3.50
80.0
0.52s
DeepSeek R1 0528 (May '25) (Vertex)
DeepSeek R1 0528 (May '25) (Vertex)
DeepSeekGoogle (Vertex)
59
128,000
$0.00
121.3
0.38s
DeepSeek R1 0528 Qwen3 8B
DeepSeek R1 0528 Qwen3 8B
DeepSeekParasail
44
131,000
$0.06
124.9
0.41s
DeepSeek R1 0528 Qwen3 8B
DeepSeek R1 0528 Qwen3 8B
DeepSeekNovita
44
128,000
$0.07
90.7
0.79s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekLambda
37
128,000
$0.30
64.6
0.28s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekCerebras
37
66,000
$0.94
2473.0
0.21s
DeepSeek R1 Distill Llama 70B Base
DeepSeek R1 Distill Llama 70B
DeepSeekNebius (Base)
37
128,000
$0.38
59.5
0.56s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekDeepinfra
37
128,000
$0.17
32.5
0.38s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekNovita
37
32,000
$0.80
32.5
0.59s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekGMI
37
24,000
$0.38
35.9
0.82s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekGroq
37
128,000
$0.81
368.3
0.17s
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B
DeepSeekSambaNova
37
131,000
$0.88
315.5
1.67s
DeepSeek R1 Distill Llama 8B
DeepSeek R1 Distill Llama 8B
DeepSeekNovita
25
32,000
$0.04
55.9
0.74s
DeepSeek R1 Distill Qwen 14B
DeepSeek R1 Distill Qwen 14B
DeepSeekNovita
38
64,000
$0.15
44.0
1.24s
DeepSeek R1 Distill Qwen 14B
DeepSeek R1 Distill Qwen 14B
DeepSeekGMI
38
131,000
$0.20
82.8
0.63s
DeepSeek R1 Distill Qwen 14B
DeepSeek R1 Distill Qwen 14B
DeepSeekTogether.ai
38
128,000
$1.60
166.5
0.26s
DeepSeek R1 Distill Qwen 32B
DeepSeek R1 Distill Qwen 32B
DeepSeekDeepinfra
41
128,000
$0.09
31.8
0.30s
DeepSeek R1 Distill Qwen 32B
DeepSeek R1 Distill Qwen 32B
DeepSeekNovita
41
64,000
$0.30
21.2
1.34s
DeepSeek V3 (Dec '24) (FP8)
DeepSeek V3 (Dec '24)
DeepSeekHyperbolic (FP8)
35
128,000
$0.25
31.0
1.42s
DeepSeek V3 (Dec '24)
DeepSeek V3 (Dec '24)
DeepSeekNebius
35
128,000
$0.75
33.8
0.61s
DeepSeek V3 (Dec '24)
DeepSeek V3 (Dec '24)
DeepSeekMicrosoft Azure
35
128,000
$2.00
74.9
0.53s
DeepSeek V3 (Dec '24)
DeepSeek V3 (Dec '24)
DeepSeekFireworks
35
128,000
$1.31
111.3
0.65s
DeepSeek V3 (Dec '24)
DeepSeek V3 (Dec '24)
DeepSeekDeepinfra
35
64,000
$0.51
34.3
0.29s
DeepSeek V3 (Dec '24) Turbo
DeepSeek V3 (Dec '24)
DeepSeekNovita (Turbo)
35
64,000
$0.63
30.7
1.04s
DeepSeek V3 (Dec '24)
DeepSeek V3 (Dec '24)
DeepSeekNovita
35
64,000
$0.89
29.9
0.79s
DeepSeek V3 (Dec '24)
DeepSeek V3 (Dec '24)
DeepSeekTogether.ai
35
128,000
$1.25
90.9
0.61s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekLambda
44
164,000
$0.47
33.5
0.55s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekDeepSeek
44
64,000
$0.48
28.7
2.91s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekReplicate
44
128,000
$1.45
89.0
0.73s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekHyperbolic
44
128,000
$1.25
35.3
1.20s
DeepSeek V3 0324 (Mar '25) Fast
DeepSeek V3 0324 (Mar '25)
DeepSeekNebius (Fast)
44
128,000
$3.00
94.6
0.71s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekNebius
44
128,000
$0.75
27.5
0.63s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekCentML
44
164,000
$0.00
85.8
0.56s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekMicrosoft Azure
44
128,000
$2.00
66.4
0.53s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekFireworks
44
160,000
$0.90
276.3
0.47s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekDeepinfra
44
164,000
$0.43
21.5
0.35s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekNovita
44
128,000
$0.57
29.1
1.00s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekGMI
44
131,000
$0.78
164.0
0.57s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekSambaNova
44
33,000
$3.38
166.1
1.77s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekTogether.ai
44
128,000
$1.25
110.5
0.63s
DeepSeek V3 0324 (Mar '25)
DeepSeek V3 0324 (Mar '25)
DeepSeekkluster.ai
44
164,000
$0.88
37.1
0.62s
Devstral
Devstral
MistralMistral
32
256,000
$0.15
140.5
0.32s
Devstral Medium
Devstral Medium
MistralMistral
32
256,000
$0.80
111.4
0.39s
Devstral Small
Devstral Small
MistralMistral
23
256,000
$0.15
159.5
0.32s
Devstral Small
Devstral Small
MistralNebius AI Studio
23
128,000
$0.12
149.1
0.49s
Devstral Small
Devstral Small
MistralDeepinfra
23
128,000
$0.12
99.7
0.53s
EXAONE 4.0 32B
EXAONE 4.0 32B
LG AI ResearchFriendliAI
42
131,000
$0.70
86.4
0.30s
EXAONE 4.0 32B (Reasoning) (FriendliAI)
EXAONE 4.0 32B (Reasoning)
LG AI ResearchFriendliAI
56
131,000
$0.70
104.1
0.32s
Gemini 1.5 Flash (Sep) (Vertex)
Gemini 1.5 Flash (Sep)
GoogleGoogle (Vertex)
39
1,000,000
$0.13
189.2
0.22s
Gemini 1.5 Flash (Sep) (AI Studio)
Gemini 1.5 Flash (Sep)
GoogleGoogle (AI Studio)
39
1,000,000
$0.13
193.0
0.32s
Gemini 1.5 Flash-8B AI Studio
Gemini 1.5 Flash-8B
GoogleGoogle AI Studio
31
1,000,000
$0.07
238.6
0.23s
Gemini 1.5 Pro
Gemini 1.5 Pro
GoogleGoogle
34
1,000,000
$7.00
102.5
1.25s
Gemini 1.5 Pro (Sep) (Vertex)
Gemini 1.5 Pro (Sep)
GoogleGoogle (Vertex)
34
2,000,000
$2.19
92.5
0.80s
Gemini 1.5 Pro (Sep) (AI Studio)
Gemini 1.5 Pro (Sep)
GoogleGoogle (AI Studio)
34
2,000,000
$2.19
93.6
2.25s
Gemini 2.0 Flash Vertex
Gemini 2.0 Flash
GoogleGoogle Vertex
38
1,000,000
$0.26
213.3
0.33s
Gemini 2.0 Flash (AI Studio)
Gemini 2.0 Flash
GoogleGoogle (AI Studio)
38
1,000,000
$0.17
242.7
0.38s
Gemini 2.0 Flash (exp) (AI Studio)
Gemini 2.0 Flash (exp)
GoogleGoogle (AI Studio)
36
1,000,000
$0.00
212.9
0.29s
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio)
Gemini 2.0 Flash-Lite (Feb '25)
GoogleGoogle (AI Studio)
33
1,000,000
$0.13
217.9
0.28s
Gemini 2.0 Flash-Lite (Preview) (AI Studio)
Gemini 2.0 Flash-Lite (Preview)
GoogleGoogle (AI Studio)
30
1,000,000
$0.13
219.4
0.29s
Gemini 2.0 Pro Experimental (AI Studio)
Gemini 2.0 Pro Experimental
GoogleGoogle (AI Studio)
49
2,000,000
$0.00
44.2
18.04s
Gemini 2.5 Flash (AI Studio)
Gemini 2.5 Flash
GoogleGoogle (AI Studio)
47
1,000,000
$0.26
284.3
0.35s
Gemini 2.5 Flash (Vertex)
Gemini 2.5 Flash
GoogleGoogle (Vertex)
47
1,000,000
$0.26
201.8
0.31s
Gemini 2.5 Flash (April '25) (AI Studio)
Gemini 2.5 Flash (April '25)
GoogleGoogle (AI Studio)
49
1,000,000
$0.26
317.7
0.37s
Gemini 2.5 Flash (April '25) (Reasoning) (AI Studio)
Gemini 2.5 Flash (April '25) (Reasoning)
GoogleGoogle (AI Studio)
60
1,000,000
$0.99
421.9
6.58s
Gemini 2.5 Flash (Reasoning) (AI Studio)
Gemini 2.5 Flash (Reasoning)
GoogleGoogle (AI Studio)
58
1,000,000
$0.99
356.0
9.39s
Gemini 2.5 Flash (Reasoning) (Vertex)
Gemini 2.5 Flash (Reasoning)
GoogleGoogle (Vertex)
58
1,000,000
$0.99
274.5
18.05s
Gemini 2.5 Flash-Lite (AI Studio)
Gemini 2.5 Flash-Lite
GoogleGoogle (AI Studio)
35
1,000,000
$0.17
472.5
0.23s
Gemini 2.5 Flash-Lite (Reasoning) (AI Studio)
Gemini 2.5 Flash-Lite (Reasoning)
GoogleGoogle (AI Studio)
46
1,000,000
$0.17
697.1
5.99s
Gemini 2.5 Pro (AI Studio)
Gemini 2.5 Pro
GoogleGoogle (AI Studio)
65
1,000,000
$3.44
147.3
38.01s
Gemini 2.5 Pro (Mar '25)
Gemini 2.5 Pro (Mar '25)
GoogleGoogle
65
1,000,000
$3.44
144.5
34.53s
Gemini 2.5 Pro (May' 25) (AI Studio)
Gemini 2.5 Pro (May' 25)
GoogleGoogle (AI Studio)
65
1,000,000
$3.44
144.5
34.08s
Gemini 2.5 Pro (May' 25) Vertex
Gemini 2.5 Pro (May' 25)
GoogleGoogle Vertex
65
1,000,000
$3.44
153.7
30.80s
Gemma 2 27B
Gemma 2 27B
GoogleTogether.ai
32
8,000
$0.80
92.0
0.27s
Gemma 2 9B Fast
Gemma 2 9B
GoogleNebius (Fast)
22
8,000
$0.04
194.7
0.46s
Gemma 2 9B
Gemma 2 9B
GoogleGroq
22
8,000
$0.20
725.7
0.20s
Gemma 3 12B
Gemma 3 12B
GoogleDeepinfra
26
128,000
$0.06
63.7
0.53s
Gemma 3 27B
Gemma 3 27B
GoogleParasail
29
131,000
$0.29
80.3
0.46s
Gemma 3 27B (AI Studio)
Gemma 3 27B
GoogleGoogle (AI Studio)
29
128,000
$0.00
23.3
0.69s
Gemma 3 27B
Gemma 3 27B
GoogleDeepinfra
29
128,000
$0.12
28.6
0.52s
Gemma 3 4B
Gemma 3 4B
GoogleDeepinfra
19
128,000
$0.03
134.3
0.23s
Gemma 3n E2B
Gemma 3n E2B (AI Studio)
GoogleGoogle (AI Studio)
13
32,000
$0.00
50.8
0.24s
Gemma 3n E4B
Gemma 3n E4B
GoogleTogether.ai
28
33,000
$0.03
51.0
0.29s
GLM-4.5
GLM-4.5
Z AISiliconFlow
52
128,000
$0.96
66.5
0.70s
GLM-4.5
GLM-4.5
Z AIFireworks
52
131,000
$0.96
112.1
0.47s
GLM-4.5
GLM-4.5
Z AIDeepinfra
52
128,000
$0.91
57.4
0.65s
GLM-4.5
GLM-4.5
Z AINovita
52
131,000
$1.00
43.5
0.77s
GLM-4.5
GLM-4.5 (FP8)
Z AIParasail
52
131,000
$0.97
115.4
0.45s
GLM-4.5-Air
GLM-4.5-Air
Z AISiliconFlow
51
128,000
$0.42
164.0
0.46s
GLM-4.5-Air
GLM-4.5-Air
Z AIDeepinfra
51
128,000
$0.42
166.3
0.24s
GLM-4.5-Air
GLM-4.5-Air
Z AITogether.ai
51
131,000
$0.42
247.1
0.33s
GPT-3.5 Turbo
GPT-3.5 Turbo
OpenAIOpenAI
11
4,000
$0.75
83.5
0.36s
GPT-4
GPT-4
OpenAIOpenAI
25
8,000
$37.50
33.1
0.74s
GPT-4 Turbo
GPT-4 Turbo
OpenAIOpenAI
28
128,000
$15.00
50.9
0.89s
GPT-4 Turbo
GPT-4 Turbo
OpenAIMicrosoft Azure
28
128,000
$15.00
53.0
1.42s
GPT-4.1
GPT-4.1
OpenAIOpenAI
45
1,000,000
$3.50
133.6
0.50s
GPT-4.1
GPT-4.1
OpenAIMicrosoft Azure
45
1,000,000
$3.50
209.3
1.08s
GPT-4.1 mini
GPT-4.1 mini
OpenAIOpenAI
44
1,000,000
$0.70
73.1
0.48s
GPT-4.1 mini
GPT-4.1 mini
OpenAIMicrosoft Azure
44
1,000,000
$0.70
217.8
0.57s
GPT-4.1 nano
GPT-4.1 nano
OpenAIOpenAI
30
1,000,000
$0.17
190.2
0.31s
GPT-4.1 nano
GPT-4.1 nano
OpenAIMicrosoft Azure
30
1,000,000
$0.17
225.2
0.69s
GPT-4.5 (Preview)
GPT-4.5 (Preview)
OpenAIOpenAI
42
128,000
$93.75
71.1
1.00s
GPT-4o (Aug '24)
GPT-4o (Aug '24)
OpenAIOpenAI
29
128,000
$4.38
119.5
0.48s
GPT-4o (Aug '24)
GPT-4o (Aug '24)
OpenAIMicrosoft Azure
29
128,000
$4.38
134.4
0.67s
GPT-4o (March 2025)
GPT-4o (March 2025)
OpenAIOpenAI
40
128,000
$7.50
196.8
0.44s
GPT-4o (May '24)
GPT-4o (May '24)
OpenAIOpenAI
30
128,000
$7.50
115.7
0.52s
GPT-4o (May '24)
GPT-4o (May '24)
OpenAIMicrosoft Azure
30
128,000
$7.50
153.3
0.64s
GPT-4o (Nov '24)
GPT-4o (Nov '24)
OpenAIOpenAI
36
128,000
$4.38
199.9
0.38s
GPT-4o (Nov '24)
GPT-4o (Nov '24)
OpenAIMicrosoft Azure
36
128,000
$4.38
133.8
1.02s
GPT-4o mini
GPT-4o mini
OpenAIOpenAI
24
128,000
$0.26
95.3
0.48s
GPT-4o mini
GPT-4o mini
OpenAIMicrosoft Azure
24
128,000
$0.26
120.0
0.78s
Grok 3
Grok 3
xAIxAI
42
131,000
$6.00
83.1
0.62s
Grok 3 Fast
Grok 3
xAIxAI (Fast)
42
131,000
$10.00
88.0
0.62s
Grok 3 mini Reasoning (high)
Grok 3 mini Reasoning (high)
xAIxAI
58
131,000
$0.35
210.7
0.60s
Grok 3 mini Reasoning (high) Fast
Grok 3 mini Reasoning (high)
xAIxAI (Fast)
58
131,000
$1.45
210.6
0.65s
Grok 4
Grok 4
xAIxAI
68
256,000
$6.00
68.2
10.03s
Jamba 1.5 Large
Jamba 1.5 Large
AI21 LabsMicrosoft Azure
20
256,000
$3.50
50.6
0.69s
Jamba 1.5 Mini
Jamba 1.5 Mini
AI21 LabsMicrosoft Azure
18
256,000
$0.25
82.4
0.48s
Jamba 1.6 Large
Jamba 1.6 Large
AI21 LabsAI21 Labs
20
256,000
$3.50
59.7
0.70s
Jamba 1.6 Mini
Jamba 1.6 Mini
AI21 LabsAI21 Labs
18
256,000
$0.25
165.2
0.60s
Jamba 1.7 Large
Jamba 1.7 Large
AI21 LabsAI21 Labs
21
256,000
$3.50
53.3
0.80s
Jamba 1.7 Mini
Jamba 1.7 Mini
AI21 LabsAI21 Labs
9
258,000
$0.25
165.7
0.66s
Kimi K2
Kimi K2
Moonshot AIParasail
48
128,000
$1.07
45.2
0.53s
Kimi K2
Kimi K2
Moonshot AIMoonshot AI
48
128,000
$1.07
45.2
0.53s
Kimi K2
Kimi K2
Moonshot AIFireworks AI
48
128,000
$1.50
45.2
0.53s
Kimi K2
Kimi K2
Moonshot AINovita
48
128,000
$1.00
45.2
0.53s
Kimi K2
Kimi K2
Moonshot AIgroq
48
128,000
$1.50
45.2
0.53s
Kimi K2
Kimi K2
Moonshot AItogether.ai
48
128,000
$1.50
45.2
0.53s
Kimi K2
Kimi K2
Moonshot AIBaseten
48
131,000
$1.07
88.9
0.18s
LFM 40B
LFM 40B
Liquid AILambda
22
32,000
$0.15
161.1
0.16s
Llama 2 Chat 7B
Llama 2 Chat 7B
MetaReplicate
8
4,000
$0.10
132.0
0.42s
Llama 3 70B
Llama 3 70B
MetaReplicate
18
8,000
$1.18
43.6
0.42s
Llama 3 70B
Llama 3 70B
MetaHyperbolic
18
8,000
$0.40
18.9
1.59s
Llama 3 70B
Llama 3 70B
MetaAmazon Bedrock
18
8,000
$2.86
47.4
0.41s
Llama 3 70B
Llama 3 70B
MetaMicrosoft Azure
18
8,000
$2.90
18.9
0.76s
Llama 3 70B
Llama 3 70B
MetaDeepinfra
18
8,000
$0.33
43.9
0.31s
Llama 3 70B
Llama 3 70B
MetaNovita
18
8,000
$0.57
18.9
1.29s
Llama 3 70B
Llama 3 70B
MetaGroq
18
8,000
$0.64
485.3
0.20s
Llama 3 70B (Reference, FP16)
Llama 3 70B
MetaTogether.ai (Reference, FP16)
18
8,000
$0.88
114.4
0.34s
Llama 3 70B (Turbo, FP8)
Llama 3 70B
MetaTogether.ai (Turbo, FP8)
18
8,000
$0.88
114.6
0.36s
Llama 3 8B
Llama 3 8B
MetaReplicate
21
8,000
$0.10
80.6
0.39s
Llama 3 8B
Llama 3 8B
MetaAmazon Bedrock
21
8,000
$0.38
103.9
0.31s
Llama 3 8B
Llama 3 8B
MetaMicrosoft Azure
21
8,000
$0.38
73.7
0.37s
Llama 3 8B
Llama 3 8B
MetaDeepinfra
21
8,000
$0.04
125.7
0.50s
Llama 3 8B
Llama 3 8B
MetaNovita
21
8,000
$0.04
72.7
0.84s
Llama 3 8B
Llama 3 8B
MetaGroq
21
8,000
$0.06
1228.9
0.27s
Llama 3.1 405B (FP8)
Llama 3.1 405B
MetaLambda (FP8)
29
128,000
$0.80
32.6
0.34s
Llama 3.1 405B
Llama 3.1 405B
MetaReplicate
29
128,000
$9.50
19.3
1.00s
Llama 3.1 405B
Llama 3.1 405B
MetaHyperbolic
29
128,000
$4.00
92.2
0.97s
Llama 3.1 405B Standard
Llama 3.1 405B
MetaAmazon Bedrock (Standard)
29
128,000
$2.40
29.7
1.85s
Llama 3.1 405B Latency Optimized
Llama 3.1 405B
MetaAmazon Bedrock (Latency Optimized)
29
128,000
$3.00
89.4
0.45s
Llama 3.1 405B Base
Llama 3.1 405B
MetaNebius (Base)
29
128,000
$1.50
32.9
0.67s
Llama 3.1 405B Vertex
Llama 3.1 405B
MetaGoogle Vertex
29
128,000
$7.75
29.7
0.42s
Llama 3.1 405B
Llama 3.1 405B
MetaMicrosoft Azure
29
128,000
$8.00
31.1
0.47s
Llama 3.1 405B
Llama 3.1 405B
MetaFireworks
29
128,000
$3.00
99.4
0.45s
Llama 3.1 405B
Llama 3.1 405B
MetaDeepinfra
29
33,000
$0.80
27.7
0.71s
Llama 3.1 405B
Llama 3.1 405B
MetaSambaNova
29
16,000
$6.25
168.2
0.70s
Llama 3.1 405B
Llama 3.1 405B
MetaDatabricks
29
128,000
$7.50
38.9
0.94s
Llama 3.1 405B Turbo
Llama 3.1 405B
MetaTogether.ai (Turbo)
29
128,000
$3.50
91.9
0.42s
Llama 3.1 405B
Llama 3.1 405B
MetaDatabricks
29
128,000
$7.50
38.7
0.97s
Llama 3.1 70B (FP8)
Llama 3.1 70B
MetaLambda (FP8)
26
128,000
$0.17
51.0
0.22s
Llama 3.1 70B
Llama 3.1 70B
MetaHyperbolic
26
128,000
$0.40
130.4
0.89s
Llama 3.1 70B Standard
Llama 3.1 70B
MetaAmazon Bedrock (Standard)
26
128,000
$0.72
31.7
0.65s
Llama 3.1 70B Latency Optimized
Llama 3.1 70B
MetaAmazon Bedrock (Latency Optimized)
26
128,000
$0.90
141.3
0.31s
Llama 3.1 70B Base
Llama 3.1 70B
MetaNebius (Base)
26
128,000
$0.20
29.9
0.65s
Llama 3.1 70B Vertex
Llama 3.1 70B
MetaGoogle Vertex
26
128,000
$0.00
72.9
0.27s
Llama 3.1 70B
Llama 3.1 70B
MetaMicrosoft Azure
26
128,000
$2.90
64.1
0.43s
Llama 3.1 70B
Llama 3.1 70B
MetaFireworks
26
128,000
$0.90
158.6
0.34s
Llama 3.1 70B (Turbo, FP8)
Llama 3.1 70B
MetaDeepinfra (Turbo, FP8)
26
128,000
$0.14
37.3
0.26s
Llama 3.1 70B
Llama 3.1 70B
MetaDeepinfra
26
128,000
$0.27
34.9
0.30s
Llama 3.1 70B Turbo
Llama 3.1 70B
MetaTogether.ai (Turbo)
26
128,000
$0.88
107.5
0.39s
Llama 3.1 70B
Llama 3.1 70B
MetaSimplismart
26
128,000
$0.90
125.4
0.50s
Llama 3.1 8B
Llama 3.1 8B
MetaLambda
14
128,000
$0.03
141.2
0.21s
Llama 3.1 8B
Llama 3.1 8B
MetaCerebras
14
33,000
$0.10
2269.2
0.25s
Llama 3.1 8B
Llama 3.1 8B
MetaHyperbolic
14
128,000
$0.10
414.2
0.70s
Llama 3.1 8B
Llama 3.1 8B
MetaAmazon Bedrock
14
128,000
$0.22
229.3
0.28s
Llama 3.1 8B Fast
Llama 3.1 8B
MetaNebius (Fast)
14
128,000
$0.04
182.5
0.47s
Llama 3.1 8B Base
Llama 3.1 8B
MetaNebius (Base)
14
128,000
$0.03
67.0
0.53s
Llama 3.1 8B Vertex
Llama 3.1 8B
MetaGoogle Vertex
14
128,000
$0.00
119.1
0.18s
Llama 3.1 8B
Llama 3.1 8B
MetaMicrosoft Azure
14
128,000
$0.38
226.2
0.31s
Llama 3.1 8B
Llama 3.1 8B
MetaFireworks
14
128,000
$0.20
306.9
0.25s
Llama 3.1 8B
Llama 3.1 8B
MetaDeepinfra
14
128,000
$0.04
55.3
0.27s
Llama 3.1 8B
Llama 3.1 8B
MetaFriendliAI
14
128,000
$0.10
469.8
0.27s
Llama 3.1 8B
Llama 3.1 8B
MetaNovita
14
16,000
$0.03
74.1
0.86s
Llama 3.1 8B
Llama 3.1 8B
MetaGroq
14
128,000
$0.06
629.1
0.16s
Llama 3.1 8B
Llama 3.1 8B
MetaSambaNova
14
16,000
$0.13
1191.1
0.26s
Llama 3.1 8B Turbo
Llama 3.1 8B
MetaTogether.ai (Turbo)
14
128,000
$0.18
159.8
0.25s
Llama 3.1 8B
Llama 3.1 8B
MetaSimplismart
14
128,000
$0.15
473.6
1.65s
Llama 3.1 8B
Llama 3.1 8B
Metakluster.ai
14
128,000
$0.18
119.0
0.24s
Llama 3.1 Nemotron 70B (FP8)
Llama 3.1 Nemotron 70B
NVIDIALambda (FP8)
28
128,000
$0.17
50.5
0.23s
Llama 3.1 Nemotron 70B
Llama 3.1 Nemotron 70B
NVIDIADeepinfra
28
128,000
$0.17
39.6
0.29s
Llama 3.2 11B (Vision)
Llama 3.2 11B (Vision)
MetaAmazon Bedrock
16
128,000
$0.16
189.6
0.46s
Llama 3.2 11B (Vision)
Llama 3.2 11B (Vision)
MetaDeepinfra
16
128,000
$0.05
53.6
0.31s
Llama 3.2 1B
Llama 3.2 1B
MetaAmazon Bedrock
10
128,000
$0.10
129.2
0.46s
Llama 3.2 1B
Llama 3.2 1B
MetaDeepinfra
10
128,000
$0.01
285.5
0.27s
Llama 3.2 3B (FP8)
Llama 3.2 3B
MetaLambda (FP8)
20
128,000
$0.02
216.0
0.20s
Llama 3.2 3B
Llama 3.2 3B
MetaHyperbolic
20
128,000
$0.10
56.2
1.16s
Llama 3.2 3B
Llama 3.2 3B
MetaAmazon Bedrock
20
128,000
$0.15
71.1
0.47s
Llama 3.2 3B
Llama 3.2 3B
MetaDeepinfra
20
128,000
$0.00
151.4
0.43s
Llama 3.2 3B
Llama 3.2 3B
MetaNovita
20
32,000
$0.04
67.3
0.73s
Llama 3.2 3B Turbo
Llama 3.2 3B
MetaTogether.ai (Turbo)
20
128,000
$0.06
156.5
0.31s
Llama 3.2 90B (Vision)
Llama 3.2 90B (Vision)
MetaAmazon Bedrock
24
128,000
$0.72
60.4
0.52s
Llama 3.2 90B (Vision) Vertex
Llama 3.2 90B (Vision)
MetaGoogle Vertex
24
128,000
$0.00
32.5
0.20s
Llama 3.2 90B (Vision)
Llama 3.2 90B (Vision)
MetaDeepinfra
24
33,000
$0.36
35.6
0.31s
Llama 3.3 70B (FP8)
Llama 3.3 70B
MetaLambda (FP8)
31
128,000
$0.17
56.5
0.29s
Llama 3.3 70B (FP8)
Llama 3.3 70B
MetaParasail (FP8)
31
131,000
$0.28
116.0
0.46s
Llama 3.3 70B
Llama 3.3 70B
MetaCerebras
33
128,000
$0.94
2455.4
0.25s
Llama 3.3 70B
Llama 3.3 70B
MetaHyperbolic
33
128,000
$0.40
41.3
1.09s
Llama 3.3 70B
Llama 3.3 70B
MetaAmazon Bedrock
33
128,000
$0.71
244.4
0.53s
Llama 3.3 70B Fast
Llama 3.3 70B
MetaNebius (Fast)
31
128,000
$0.38
191.0
0.55s
Llama 3.3 70B Base
Llama 3.3 70B
MetaNebius (Base)
31
128,000
$0.20
37.5
0.63s
Llama 3.3 70B Vertex
Llama 3.3 70B
MetaGoogle Vertex
31
128,000
$0.72
85.7
0.20s
Llama 3.3 70B Snowflake
Llama 3.3 70B
MetaSnowflake
31
8,000
$0.58
84.7
0.31s
Llama 3.3 70B
Llama 3.3 70B
MetaCentML
31
128,000
$0.00
152.8
0.53s
Llama 3.3 70B
Llama 3.3 70B
MetaMicrosoft Azure
31
128,000
$0.71
55.9
0.45s
Llama 3.3 70B
Llama 3.3 70B
MetaFireworks
31
128,000
$0.90
121.5
0.41s
Llama 3.3 70B (Turbo, FP8)
Llama 3.3 70B
MetaDeepinfra (Turbo, FP8)
31
128,000
$0.08
36.6
0.25s
Llama 3.3 70B
Llama 3.3 70B
MetaDeepinfra
31
128,000
$0.27
36.4
0.34s
Llama 3.3 70B
Llama 3.3 70B
MetaFriendliAI
31
128,000
$0.60
156.9
0.41s
Llama 3.3 70B
Llama 3.3 70B
MetaNovita
31
128,000
$0.20
44.9
0.58s
Llama 3.3 70B
Llama 3.3 70B
MetaGroq
31
128,000
$0.64
442.5
0.22s
Llama 3.3 70B
Llama 3.3 70B
MetaSambaNova
31
128,000
$0.75
446.4
0.29s
Llama 3.3 70B Turbo
Llama 3.3 70B
MetaTogether.ai (Turbo)
31
128,000
$0.88
108.0
0.34s
Llama 3.3 70B
Llama 3.3 70B
Metakluster.ai
31
128,000
$0.70
32.4
0.39s
Llama 3.3 70B
Llama 3.3 70B Snowflake
MetaSnowflake
33
8,000
$0.58
180.2
0.26s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaLambda (FP8)
33
1,000,000
$0.28
171.1
0.23s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaParasail (FP8)
33
1,000,000
$0.35
183.7
0.36s
Llama 4 Maverick
Llama 4 Maverick
MetaAmazon Bedrock
33
128,000
$0.42
317.9
0.59s
Llama 4 Maverick Vertex
Llama 4 Maverick
MetaGoogle Vertex
33
524,000
$0.55
125.0
0.36s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaCentML (FP8)
33
1,000,000
$0.00
131.9
0.32s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaMicrosoft Azure (FP8)
33
128,000
$0.61
171.8
0.32s
Llama 4 Maverick (Base)
Llama 4 Maverick
MetaFireworks (Base)
33
1,000,000
$0.39
175.2
0.44s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaDeepinfra (FP8)
33
131,000
$0.26
105.3
0.23s
Llama 4 Maverick (Turbo, FP8)
Llama 4 Maverick
MetaDeepinfra (Turbo, FP8)
33
8,000
$0.50
979.1
0.18s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaNovita (FP8)
33
1,000,000
$0.34
108.2
0.51s
Llama 4 Maverick (FP8)
Llama 4 Maverick
MetaGMI (FP8)
33
1,000,000
$0.39
159.2
0.46s
Llama 4 Maverick
Llama 4 Maverick
MetaGroq
33
128,000
$0.30
559.8
0.11s
Llama 4 Maverick
Llama 4 Maverick
MetaSambaNova
33
131,000
$0.92
801.8
0.36s
Llama 4 Maverick
Llama 4 Maverick
MetaTogether.ai
33
524,000
$0.41
120.1
0.35s
Llama 4 Maverick (FP8)
Llama 4 Maverick
Metakluster.ai (FP8)
33
1,000,000
$0.35
168.9
0.40s
Llama 4 Maverick
Llama 4 Maverick
MetaCerebras
51
32,000
$0.30
2104.2
0.22s
Llama 4 Maverick
Llama 4 Maverick
MetaMicrosoft Azure
51
128,000
$0.61
191.1
0.32s
Llama 4 Maverick
Llama 4 Maverick
MetaNovita
51
1,000,000
$0.34
133.5
0.44s
Llama 4 Maverick
Llama 4 Maverick
MetaGMI
51
1,000,000
$0.39
188.3
0.35s
Llama 4 Scout
Llama 4 Scout
MetaLambda
34
1,000,000
$0.14
118.0
0.24s
Llama 4 Scout (FP8)
Llama 4 Scout
MetaParasail (FP8)
34
158,000
$0.19
114.7
0.36s
Llama 4 Scout
Llama 4 Scout
MetaCerebras
34
32,000
$0.70
2808.5
0.26s
Llama 4 Scout
Llama 4 Scout
MetaAmazon Bedrock
34
128,000
$0.29
169.0
0.51s
Llama 4 Scout Vertex
Llama 4 Scout
MetaGoogle Vertex
34
1,000,000
$0.36
134.6
0.37s
Llama 4 Scout
Llama 4 Scout
MetaCentML
34
1,000,000
$0.00
119.1
0.34s
Llama 4 Scout
Llama 4 Scout
MetaMicrosoft Azure
34
128,000
$0.34
113.2
0.34s
Llama 4 Scout (Base)
Llama 4 Scout
MetaFireworks (Base)
34
1,000,000
$0.26
167.9
0.48s
Llama 4 Scout
Llama 4 Scout
MetaDeepinfra
34
131,000
$0.14
36.4
0.33s
Llama 4 Scout
Llama 4 Scout
MetaNovita
34
131,000
$0.20
115.1
0.49s
Llama 4 Scout
Llama 4 Scout
MetaGMI
34
1,000,000
$0.18
130.3
0.43s
Llama 4 Scout
Llama 4 Scout
MetaGroq
34
131,000
$0.17
599.5
0.19s
Llama 4 Scout
Llama 4 Scout
MetaTogether.ai
34
328,000
$0.28
123.8
0.20s
Llama 4 Scout
Llama 4 Scout
Metakluster.ai
34
128,000
$0.71
90.8
0.45s
Llama Nemotron Ultra Reasoning Base
Llama Nemotron Ultra Reasoning
NVIDIANebius (Base)
46
131,000
$0.90
41.8
0.64s
Magistral Medium
Magistral Medium
MistralMistral
38
41,000
$2.75
124.5
0.43s
Magistral Small
Magistral Small
MistralMistral
36
40,000
$0.75
196.6
0.31s
MiniMax M1 40k
MiniMax M1 40k
MiniMaxMiniMax
51
1,000,000
$0.82
19.4
1.31s
MiniMax M1 80k
MiniMax M1 80k
MiniMaxMiniMax
53
1,000,000
$0.82
20.2
1.46s
MiniMax-Text-01
MiniMax-Text-01
MiniMaxMiniMax
29
1,000,000
$0.42
30.4
1.15s
Ministral 3B
Ministral 3B
MistralMistral
10
128,000
$0.04
270.1
0.29s
Ministral 8B
Ministral 8B
MistralMistral
13
128,000
$0.10
197.3
0.33s
Mistral 7B
Mistral 7B
MistralMistral
10
8,000
$0.25
125.1
0.30s
Mistral 7B
Mistral 7B
MistralAmazon Bedrock
10
8,000
$0.16
93.8
0.32s
Mistral 7B
Mistral 7B
MistralDeepinfra
10
8,000
$0.04
102.6
0.20s
Mistral 7B
Mistral 7B
MistralNovita
10
32,000
$0.04
117.1
0.86s
Mistral 7B
Mistral 7B
MistralTogether.ai
10
8,000
$0.20
157.7
0.20s
Mistral Large (Feb '24)
Mistral Large (Feb '24)
MistralAmazon Bedrock
26
33,000
$6.00
45.1
0.40s
Mistral Large 2 (Jul '24)
Mistral Large 2 (Jul '24)
MistralMistral
28
128,000
$3.00
95.6
0.42s
Mistral Large 2 (Jul '24)
Mistral Large 2 (Jul '24)
MistralAmazon Bedrock
28
128,000
$3.00
33.6
0.45s
Mistral Large 2 (Nov '24)
Mistral Large 2 (Nov '24)
MistralMistral
38
128,000
$3.00
32.9
0.51s
Mistral Large 2 (Nov '24)
Mistral Large 2 (Nov '24)
MistralMicrosoft Azure
38
128,000
$3.00
23.8
0.49s
Mistral Medium
Mistral Medium
MistralMistral
23
33,000
$4.09
53.0
0.45s
Mistral Medium 3
Mistral Medium 3
MistralMistral
39
128,000
$0.80
57.7
0.42s
Mistral Medium 3
Mistral Medium 3
MistralMicrosoft Azure
39
128,000
$0.80
55.0
0.42s
Mistral NeMo
Mistral NeMo
MistralMistral
20
128,000
$0.15
176.6
0.30s
Mistral NeMo (FP8)
Mistral NeMo
MistralParasail (FP8)
20
131,000
$0.11
151.3
0.36s
Mistral NeMo Base
Mistral NeMo
MistralNebius (Base)
20
128,000
$0.06
51.5
0.54s
Mistral NeMo
Mistral NeMo
MistralDeepinfra
20
128,000
$0.01
50.5
0.23s
Mistral Saba
Mistral Saba
MistralMistral
23
32,000
$0.30
95.3
0.31s
Mistral Small (Feb '24)
Mistral Small (Feb '24)
MistralMistral
23
33,000
$1.50
208.7
0.30s
Mistral Small (Feb '24)
Mistral Small (Feb '24)
MistralMicrosoft Azure
23
33,000
$1.50
87.9
0.41s
Mistral Small (Sep '24)
Mistral Small (Sep '24)
MistralMistral
18
33,000
$0.30
124.7
0.31s
Mistral Small 3
Mistral Small 3
MistralMistral
26
32,000
$0.15
110.1
0.33s
Mistral Small 3
Mistral Small 3
MistralDeepinfra
26
32,000
$0.06
85.1
0.20s
Mistral Small 3
Mistral Small 3
MistralTogether.ai
26
32,000
$0.80
95.9
0.18s
Mistral Small 3.1
Mistral Small 3.1
MistralMistral
26
128,000
$0.15
179.1
0.28s
Mistral Small 3.1
Mistral Small 3.1
MistralParasail
26
128,000
$0.15
61.5
0.41s
Mistral Small 3.1 Vertex
Mistral Small 3.1
MistralGoogle Vertex
26
128,000
$0.15
34.4
0.22s
Mistral Small 3.2
Mistral Small 3.2
MistralMistral
32
33,000
$0.15
204.1
0.29s
Mistral Small 3.2 (FP8)
Mistral Small 3.2
MistralDeepinfra (FP8)
32
128,000
$0.06
39.1
0.38s
Mixtral 8x22B
Mixtral 8x22B
MistralMistral
17
65,000
$3.00
55.5
0.37s
Mixtral 8x22B
Mixtral 8x22B
MistralFireworks
17
65,000
$1.20
75.3
0.34s
Mixtral 8x7B
Mixtral 8x7B
MistralMistral
17
33,000
$0.70
78.6
0.36s
Mixtral 8x7B
Mixtral 8x7B
MistralAmazon Bedrock
17
33,000
$0.51
93.3
0.33s
Mixtral 8x7B
Mixtral 8x7B
MistralDeepinfra
17
33,000
$0.12
85.5
0.25s
Mixtral 8x7B
Mixtral 8x7B
MistralTogether.ai
17
33,000
$0.60
46.6
0.94s
Nova Lite
Nova Lite
AmazonAmazon Bedrock
33
300,000
$0.10
229.2
0.46s
Nova Micro
Nova Micro
AmazonAmazon Bedrock
28
130,000
$0.06
371.5
0.43s
Nova Premier
Nova Premier
AmazonAmazon Bedrock
35
1,000,000
$5.00
85.5
0.97s
Nova Pro
Nova Pro
AmazonAmazon Bedrock
37
300,000
$1.40
115.8
0.53s
o1
o1
OpenAIOpenAI
53
200,000
$26.25
192.0
14.52s
o1
o1
OpenAIMicrosoft Azure
53
200,000
$26.25
114.0
26.41s
o1-mini
o1-mini
OpenAIOpenAI
45
128,000
$1.93
244.2
9.05s
o1-mini
o1-mini
OpenAIMicrosoft Azure
45
128,000
$1.93
270.4
8.87s
o1-preview
o1-preview
OpenAIOpenAI
49
128,000
$26.25
165.2
19.20s
o1-preview
o1-preview
OpenAIMicrosoft Azure
49
128,000
$28.88
155.0
19.64s
o3
o3
OpenAIOpenAI
67
200,000
$3.50
200.4
13.03s
o3
o3
OpenAIMicrosoft Azure
67
200,000
$3.50
99.1
31.89s
o3-mini
o3-mini
OpenAIOpenAI
53
200,000
$1.93
189.1
11.71s
o3-mini
o3-mini
OpenAIMicrosoft Azure
53
200,000
$1.93
218.3
11.66s
o3-mini (high)
o3-mini (high)
OpenAIOpenAI
57
200,000
$1.93
194.2
35.49s
o3-mini (high)
o3-mini (high)
OpenAIMicrosoft Azure
57
200,000
$1.93
213.2
32.91s
o3-pro
o3-pro
OpenAIOpenAI
68
200,000
$35.00
32.6
85.76s
o3-pro
o3-pro
OpenAIOpenAI
68
200,000
$35.00
21.9
115.76s
o4-mini (high)
o4-mini (high)
OpenAIOpenAI
65
200,000
$1.93
113.9
47.82s
o4-mini (high)
o4-mini (high)
OpenAIMicrosoft Azure
65
200,000
$1.93
152.2
30.12s
Phi-3 Medium 14B
Phi-3 Medium 14B
Microsoft AzureMicrosoft Azure
25
128,000
$0.30
52.9
0.43s
Phi-4
Phi-4
Microsoft AzureNebius
28
16,000
$0.15
106.2
0.48s
Phi-4
Phi-4
Microsoft AzureMicrosoft Azure
28
16,000
$0.22
22.2
0.47s
Phi-4
Phi-4
Microsoft AzureDeepinfra
28
16,000
$0.09
39.1
0.26s
Phi-4 Mini
Phi-4 Mini
Microsoft AzureMicrosoft Azure
26
128,000
$0.00
31.7
0.38s
Phi-4 Multimodal
Phi-4 Multimodal
Microsoft AzureMicrosoft Azure
18
128,000
$0.00
22.0
0.33s
Pixtral 12B
Pixtral 12B
MistralMistral
14
128,000
$0.15
103.3
0.31s
Pixtral 12B
Pixtral 12B
MistralHyperbolic
14
128,000
$0.10
104.7
0.44s
Pixtral Large
Pixtral Large
MistralMistral
37
128,000
$3.00
95.4
0.41s
Qwen2 72B
Qwen2 72B
AlibabaTogether.ai
33
33,000
$0.90
39.0
0.43s
Qwen2 72B
Qwen2 72B
AlibabaAlibaba Cloud
33
131,000
$0.00
31.0
1.33s
Qwen2.5 72B
Qwen2.5 72B
AlibabaHyperbolic
29
131,000
$0.40
31.9
1.29s
Qwen2.5 72B
Qwen2.5 72B
AlibabaNebius
29
131,000
$0.20
35.5
0.63s
Qwen2.5 72B Fast
Qwen2.5 72B
AlibabaNebius (Fast)
29
131,000
$0.38
65.2
0.52s
Qwen2.5 72B
Qwen2.5 72B
AlibabaFireworks
29
131,000
$0.90
75.2
0.32s
Qwen2.5 72B
Qwen2.5 72B
AlibabaDeepinfra
29
33,000
$0.19
42.0
0.54s
Qwen2.5 72B Turbo
Qwen2.5 72B
AlibabaTogether.ai (Turbo)
29
131,000
$1.20
114.8
0.28s
Qwen2.5 72B
Qwen2.5 72B
AlibabaAlibaba Cloud
29
131,000
$0.00
58.2
1.28s
Qwen2.5 Coder 32B
Qwen2.5 Coder 32B
AlibabaLambda
27
33,000
$0.09
42.9
0.32s
Qwen2.5 Coder 32B
Qwen2.5 Coder 32B
AlibabaHyperbolic
27
131,000
$0.20
51.8
1.13s
Qwen2.5 Coder 32B
Qwen2.5 Coder 32B
AlibabaDeepinfra
27
33,000
$0.08
50.2
0.24s
Qwen2.5 Coder 32B
Qwen2.5 Coder 32B
AlibabaTogether.ai
27
131,000
$0.80
90.3
0.21s
Qwen2.5 Instruct 32B Fast
Qwen2.5 Instruct 32B
AlibabaNebius (Fast)
37
128,000
$0.20
88.5
0.52s
Qwen2.5 Instruct 32B Base
Qwen2.5 Instruct 32B
AlibabaNebius (Base)
37
128,000
$0.10
58.7
0.55s
Qwen2.5 Max
Qwen2.5 Max
AlibabaAlibaba Cloud
34
32,000
$2.80
40.2
1.41s
Qwen2.5 Turbo
Qwen2.5 Turbo
AlibabaAlibaba Cloud
34
1,000,000
$0.09
48.9
1.07s
Qwen3 0.6B
Qwen3 0.6B
AlibabaAlibaba Cloud
17
33,000
$0.19
232.8
0.91s
Qwen3 0.6B (Reasoning)
Qwen3 0.6B (Reasoning)
AlibabaAlibaba Cloud
23
33,000
$0.40
229.3
0.94s
Qwen3 1.7B
Qwen3 1.7B
AlibabaAlibaba Cloud
25
33,000
$0.19
141.2
0.92s
Qwen3 1.7B (Reasoning)
Qwen3 1.7B (Reasoning)
AlibabaAlibaba Cloud
38
33,000
$0.40
138.8
0.94s
Qwen3 14B
Qwen3 14B
AlibabaAlibaba Cloud
32
131,000
$0.61
66.5
1.17s
Qwen3 14B (Reasoning) Base
Qwen3 14B (Reasoning)
AlibabaNebius (Base)
46
33,000
$0.12
85.8
0.52s
Qwen3 14B (Reasoning) (FP8)
Qwen3 14B (Reasoning)
AlibabaDeepinfra (FP8)
46
128,000
$0.12
75.0
0.51s
Qwen3 14B (Reasoning)
Qwen3 14B (Reasoning)
AlibabaAlibaba Cloud
56
131,000
$1.31
65.6
1.06s
Qwen3 235B
Qwen3 235B
AlibabaGMI
47
41,000
$0.40
64.4
0.62s
Qwen3 235B
Qwen3 235B
AlibabaAlibaba Cloud
47
131,000
$1.23
42.4
1.28s
Qwen3 235B (Reasoning) (FP8)
Qwen3 235B (Reasoning)
AlibabaParasail (FP8)
56
41,000
$0.35
57.3
0.45s
Qwen3 235B (Reasoning) Base
Qwen3 235B (Reasoning)
AlibabaNebius (Base)
56
33,000
$0.30
50.2
0.56s
Qwen3 235B (Reasoning)
Qwen3 235B (Reasoning)
AlibabaFireworks
56
128,000
$0.10
79.0
0.84s
Qwen3 235B (Reasoning) (FP8)
Qwen3 235B (Reasoning)
AlibabaDeepinfra (FP8)
56
41,000
$0.30
15.2
0.44s
Qwen3 235B (Reasoning) (FP8)
Qwen3 235B (Reasoning)
AlibabaNovita (FP8)
56
128,000
$0.35
17.3
0.66s
Qwen3 235B (Reasoning)
Qwen3 235B (Reasoning)
AlibabaGMI
56
41,000
$0.40
60.7
0.60s
Qwen3 235B (Reasoning) (FP8)
Qwen3 235B (Reasoning)
AlibabaTogether.ai (FP8)
56
41,000
$0.30
38.6
0.37s
Qwen3 235B (Reasoning) (FP8)
Qwen3 235B (Reasoning)
Alibabakluster.ai (FP8)
56
41,000
$0.61
41.5
0.49s
Qwen3 235B (Reasoning)
Qwen3 235B (Reasoning)
AlibabaAlibaba Cloud
56
131,000
$2.63
42.4
1.14s
Qwen3 235B 2507 (Non-reasoning)
Qwen3 235B 2507 (Non-reasoning)
AlibabaParasail
48
262,000
$0.33
53.7
0.41s
Qwen3 235B 2507 (Non-reasoning)
Qwen3 235B 2507 (Non-reasoning)
AlibabaCerebras
48
131,000
$0.75
1467.0
0.23s
Qwen3 235B 2507 (Non-reasoning)
Qwen3 235B 2507 (Non-reasoning)
AlibabaFireworks
48
128,000
$0.39
131.2
0.55s
Qwen3 235B 2507 (Non-reasoning)
Qwen3 235B 2507 (Non-reasoning)
AlibabaDeepinfra
48
262,000
$0.25
35.4
0.50s
Qwen3 235B 2507 (Non-reasoning)
Qwen3 235B 2507 (Non-reasoning)
AlibabaNovita
48
262,000
$0.31
83.2
0.81s
Qwen3 235B 2507 (Non-reasoning)
Qwen3 235B 2507 (Non-reasoning)
AlibabaTogether.ai
48
256,000
$0.30
37.0
0.31s
Qwen3 235B 2507 (Reasoning)
Qwen3 235B 2507 (Reasoning)
AlibabaParasail
64
132,000
$1.24
71.3
0.47s
Qwen3 235B 2507 (Reasoning)
Qwen3 235B 2507 (Reasoning)
AlibabaCerebras
64
131,000
$0.75
1623.9
0.25s
Qwen3 235B 2507 (Reasoning)
Qwen3 235B 2507 (Reasoning)
AlibabaDeepinfra
64
262,000
$0.25
30.8
0.33s
Qwen3 235B 2507 (Reasoning)
Qwen3 235B 2507 (Reasoning)
AlibabaNovita
64
132,000
$0.97
52.2
1.00s
Qwen3 235B 2507 (Reasoning)
Qwen3 235B 2507 (Reasoning)
AlibabaTogether.ai
64
262,000
$1.24
45.4
0.34s
Qwen3 30B 2507 (Non-reasoning)
Qwen3 30B 2507 (Non-reasoning)
AlibabaAlibaba Cloud
57
128,000
$0.35
106.2
1.17s
Qwen3 30B 2507 (Reasoning)
Qwen3 30B 2507 (Reasoning)
AlibabaAlibaba Cloud
55
128,000
$0.75
110.4
1.18s
Qwen3 30B A3B
Qwen3 30B A3B
AlibabaAlibaba Cloud
34
131,000
$0.35
49.2
1.12s
Qwen3 30B A3B (Reasoning) (FP8)
Qwen3 30B A3B (Reasoning)
AlibabaParasail (FP8)
42
41,000
$0.20
158.7
0.39s
Qwen3 30B A3B (Reasoning) Fast
Qwen3 30B A3B (Reasoning)
AlibabaNebius (Fast)
42
33,000
$0.45
139.2
0.51s
Qwen3 30B A3B (Reasoning) Base
Qwen3 30B A3B (Reasoning)
AlibabaNebius (Base)
42
33,000
$0.15
124.9
0.49s
Qwen3 30B A3B (Reasoning)
Qwen3 30B A3B (Reasoning)
AlibabaFireworks
42
131,000
$0.90
167.9
0.38s
Qwen3 30B A3B (Reasoning) (FP8)
Qwen3 30B A3B (Reasoning)
AlibabaDeepinfra (FP8)
42
41,000
$0.15
112.1
0.19s
Qwen3 30B A3B (Reasoning) (FP8)
Qwen3 30B A3B (Reasoning)
AlibabaNovita (FP8)
42
128,000
$0.19
162.4
0.71s
Qwen3 30B A3B (Reasoning)
Qwen3 30B A3B (Reasoning)
AlibabaAlibaba Cloud
42
131,000
$0.75
48.5
1.21s
Qwen3 32B (FP8)
Qwen3 32B
AlibabaParasail (FP8)
30
41,000
$0.20
53.9
0.45s
Qwen3 32B
Qwen3 32B
AlibabaCerebras
30
128,000
$0.50
2359.6
0.24s
Qwen3 32B Base
Qwen3 32B
AlibabaNebius (Base)
30
33,000
$0.15
46.5
0.57s
Qwen3 32B Fast
Qwen3 32B
AlibabaNebius (Fast)
30
33,000
$0.30
199.0
0.52s
Qwen3 32B (FP8)
Qwen3 32B
AlibabaNovita (FP8)
30
41,000
$0.19
45.5
1.08s
Qwen3 32B (FP8)
Qwen3 32B
AlibabaGMI (FP8)
30
33,000
$0.23
53.8
0.81s
Qwen3 32B
Qwen3 32B
AlibabaGroq
30
131,000
$0.36
616.0
0.14s
Qwen3 32B
Qwen3 32B
AlibabaSambaNova
30
33,000
$0.50
344.8
0.37s
Qwen3 32B
Qwen3 32B
AlibabaAlibaba Cloud
30
131,000
$1.23
62.2
1.10s
Qwen3 32B (Reasoning) (FP8)
Qwen3 32B (Reasoning)
AlibabaParasail (FP8)
55
41,000
$0.20
53.5
0.48s
Qwen3 32B (Reasoning)
Qwen3 32B (Reasoning)
AlibabaCerebras
55
41,000
$0.50
2496.1
0.24s
Qwen3 32B (Reasoning) Base
Qwen3 32B (Reasoning)
AlibabaNebius (Base)
55
33,000
$0.15
46.1
0.57s
Qwen3 32B (Reasoning) (FP8)
Qwen3 32B (Reasoning)
AlibabaDeepinfra (FP8)
55
41,000
$0.15
42.0
0.57s
Qwen3 32B (Reasoning) (FP8)
Qwen3 32B (Reasoning)
AlibabaNovita (FP8)
55
128,000
$0.19
44.0
1.04s
Qwen3 32B (Reasoning) (FP8)
Qwen3 32B (Reasoning)
AlibabaGMI (FP8)
55
33,000
$0.23
53.4
0.71s
Qwen3 32B (Reasoning)
Qwen3 32B (Reasoning)
AlibabaGroq
55
131,000
$0.36
627.3
0.14s
Qwen3 32B (Reasoning)
Qwen3 32B (Reasoning)
AlibabaSambaNova
55
33,000
$0.50
265.6
0.43s
Qwen3 32B (Reasoning)
Qwen3 32B (Reasoning)
AlibabaAlibaba Cloud
55
131,000
$2.63
60.6
1.06s
Qwen3 4B
Qwen3 4B
AlibabaAlibaba Cloud
26
131,000
$0.19
106.4
0.99s
Qwen3 4B (Reasoning) Fast
Qwen3 4B (Reasoning)
AlibabaNebius (Fast)
36
33,000
$0.12
157.6
0.47s
Qwen3 4B (Reasoning) (FP8)
Qwen3 4B (Reasoning)
AlibabaNovita (FP8)
36
128,000
$0.00
71.4
0.72s
Qwen3 4B (Reasoning)
Qwen3 4B (Reasoning)
AlibabaAlibaba Cloud
36
131,000
$0.40
104.8
1.09s
Qwen3 8B
Qwen3 8B
AlibabaAlibaba Cloud
37
131,000
$0.31
100.2
0.98s
Qwen3 8B (Reasoning) (FP8)
Qwen3 8B (Reasoning)
AlibabaNovita (FP8)
51
128,000
$0.06
55.2
0.80s
Qwen3 8B (Reasoning)
Qwen3 8B (Reasoning)
AlibabaAlibaba Cloud
51
131,000
$0.66
98.8
0.99s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaParasail
46
262,000
$1.63
49.8
0.42s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaCerebras
46
131,000
$2.00
1650.1
0.26s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaHyperbolic
46
262,000
$2.00
19.8
1.99s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaFireworks
46
262,000
$0.79
137.4
0.48s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaDeepinfra
46
262,000
$0.70
51.6
0.34s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaDeepinfra
46
262,000
$0.53
50.2
0.23s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaNovita
46
262,000
$1.96
45.1
0.60s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaGMI
46
33,000
$1.25
83.2
0.36s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaTogether.ai
46
262,000
$2.00
50.8
0.47s
Qwen3 Coder 480B
Qwen3 Coder 480B
AlibabaAlibaba Cloud
46
262,000
$3.00
52.2
1.85s
QwQ 32B-Preview
QwQ 32B-Preview
AlibabaDeepinfra
34
33,000
$0.14
32.7
0.35s
QwQ 32B-Preview
QwQ 32B-Preview
AlibabaTogether.ai
34
33,000
$1.20
63.1
0.70s
QwQ-32B
QwQ-32B
AlibabaHyperbolic
50
131,000
$0.20
142.8
1.07s
QwQ-32B Fast
QwQ-32B
AlibabaNebius (Fast)
50
131,000
$0.75
85.7
0.51s
QwQ-32B Base
QwQ-32B
AlibabaNebius (Base)
50
131,000
$0.23
52.3
0.55s
QwQ-32B
QwQ-32B
AlibabaFireworks
50
131,000
$0.90
179.5
0.45s
QwQ-32B
QwQ-32B
AlibabaDeepinfra
50
131,000
$0.09
47.9
0.26s
QwQ-32B
QwQ-32B
AlibabaGroq
50
131,000
$0.32
411.6
0.23s
QwQ-32B
QwQ-32B
AlibabaTogether.ai
50
131,000
$1.20
62.9
0.52s
QwQ-32B
QwQ-32B
AlibabaGMI
50
41,000
$0.75
52.3
0.39s
Reka Flash 3
Reka Flash 3
Reka AIReka AI
38
128,000
$0.35
55.9
1.25s
Solar Pro 2
Solar Pro 2
UpstageUpstage
37
64,000
$0.00
112.8
1.82s
Solar Pro 2 (Reasoning)
Solar Pro 2 (Reasoning)
UpstageUpstage
49
64,000
$0.00
104.7
1.67s
Sonar
Sonar
PerplexityPerplexity
34
127,000
$1.00
172.4
1.83s
Sonar Pro
Sonar Pro
PerplexityPerplexity
34
200,000
$6.00
154.3
1.89s

About This Tool

This interactive tool helps you compare different LLM providers and models based on various metrics like price, performance, and capabilities.

Data is sourced from artificialanalysis.ai and is updated regularly to reflect the latest information available.

Use the filters and chart configuration options to customize your view and find the perfect LLM for your specific needs.

Frequently Asked Questions