Leaderboard
Sort and filter 100+ AI models by quality, price, speed, and context window — all from a single live view.
Model | Creator | Quality | Best Price | Top Speed | Max Context | Providers | Actions | |
|---|---|---|---|---|---|---|---|---|
Qwen3.7 Max OSS | Alibaba | 56.584 | $3.75/M | 197 tok/s | 991K | 1 | Compare | |
Kimi K2.6 OSS14 providers | Kimi | 53.905 | $1.44/M | 327 tok/s | 262K | 14 | Compare | |
MiMo-V2.5-Pro OSS4 providers | Xiaomi | 53.829 | $1.20/M | 83 tok/s | 1050K | 4 | Compare | |
Qwen3.6 Max Preview OSS | Alibaba | 51.814 | $2.92/M | 34 tok/s | 256K | 1 | Compare | |
DeepSeek V4 Pro OSS9 providers | DeepSeek | 51.509 | $0.54/M | 158 tok/s | 1049K | 9 | Compare | |
GLM-5.1 (Reasoning) OSS10 providers | Z AI | 51.408 | $1.66/M | 157 tok/s | 205K | 10 | Compare | |
Qwen3.6 Plus OSS | Alibaba | 49.985 | $1.13/M | 53 tok/s | 1000K | 1 | Compare | |
GLM-5 (Reasoning) OSS8 providers | Z AI | 49.77 | $1.24/M | 171 tok/s | 203K | 8 | Compare | |
MiniMax-M2.7 OSS6 providers | MiniMax | 49.615 | $0.52/M | 453 tok/s | 205K | 6 | Compare | |
MiMo-V2-Pro OSS | Xiaomi | 49.202 | $1.50/M | 61 tok/s | 131K | 1 | Compare | |
MiMo-V2.5 OSS2 providers | Xiaomi | 49.034 | $0.64/M | 95 tok/s | 1050K | 2 | Compare | |
Kimi K2.5 (Reasoning) OSS13 providers | Kimi | 46.813 | $0.90/M | 382 tok/s | 262K | 13 | Compare | |
DeepSeek V4 Flash OSS6 providers | DeepSeek | 46.517 | $0.01/M | 138 tok/s | 1049K | 6 | Compare | |
Qwen3.6 27B (Reasoning) OSS4 providers | Alibaba | 45.823 | $1.02/M | 64 tok/s | 262K | 4 | Compare | |
Qwen3.5 397B A17B (Reasoning) OSS12 providers | Alibaba | 45.047 | $0.88/M | 297 tok/s | 262K | 12 | Compare | |
MiMo-V2-Omni-0327 OSS | Xiaomi | 44.932 | $0.80/M | 92 tok/s | 131K | 1 | Compare | |
GLM-5.1 OSS7 providers | Z AI | 43.816 | $2.15/M | 167 tok/s | 205K | 7 | Compare | |
Qwen3.6 35B A3B (Reasoning) OSS6 providers | Alibaba | 43.494 | $0.40/M | 368 tok/s | 262K | 6 | Compare | |
MiMo-V2-Omni OSS | Xiaomi | 43.4 | $0.00/M | 96 tok/s | 256K | 1 | Compare | |
GLM-4.7 (Reasoning) OSS8 providers | Z AI | 42.108 | $0.74/M | 1104 tok/s | 205K | 8 | Compare | |
Qwen3.5 27B (Reasoning) OSS5 providers | Alibaba | 42.069 | $0.69/M | 85 tok/s | 262K | 5 | Compare | |
MiniMax-M2.5 OSS14 providers | MiniMax | 41.933 | $0.40/M | 302 tok/s | 205K | 14 | Compare | |
Hy3-preview (Reasoning) OSS2 providers | Tencent | 41.853 | $0.11/M | 98 tok/s | 262K | 2 | Compare | |
DeepSeek V3.2 (Reasoning) OSS8 providers | DeepSeek | 41.708 | $0.30/M | 236 tok/s | 164K | 8 | Compare | |
Qwen3.5 122B A10B (Reasoning) OSS5 providers | Alibaba | 41.602 | $0.72/M | 146 tok/s | 262K | 5 | Compare | |
MiMo-V2-Flash OSS1 provider | Xiaomi | 41.459 | $0.15/M | 132 tok/s | 256K | 1 | Compare | |
Kimi K2 Thinking OSS6 providers | Kimi | 40.893 | $1.07/M | 131 tok/s | 262K | 6 | Compare | |
GLM-5 OSS4 providers | Z AI | 40.571 | $0.97/M | 102 tok/s | 203K | 4 | Compare | |
Qwen3.5 397B A17B OSS8 providers | Alibaba | 40.099 | $1.25/M | 282 tok/s | 262K | 8 | Compare | |
Qwen3 Max Thinking OSS1 provider | Alibaba | 39.849 | $2.40/M | 49 tok/s | 262K | 1 | Compare | |
MiniMax-M2.1 OSS2 providers | MiniMax | 39.42 | $0.52/M | 98 tok/s | 205K | 2 | Compare | |
MiMo-V2-Flash (Reasoning) OSS | Xiaomi | 39.242 | $0.15/M | 127 tok/s | 256K | 1 | Compare | |
Mistral Medium 3.5 OSS | Mistral | 39.226 | $3.00/M | 149 tok/s | 262K | 1 | Compare | |
Gemma 4 31B (Reasoning) OSS9 providers | 39.183 | $0.00/M | 114 tok/s | 1049K | 9 | Compare | ||
Qwen3.5 Omni Plus OSS | Alibaba | 38.633 | $1.50/M | 55 tok/s | 256K | 1 | Compare | |
Ring-2.6-1T OSS | InclusionAI | 38.456 | $0.85/M | 123 tok/s | 262K | 1 | Compare | |
Step 3.5 Flash OSS2 providers | StepFun | 37.797 | $0.15/M | 152 tok/s | 262K | 2 | Compare | |
Kimi K2.5 OSS8 providers | Kimi | 37.274 | $0.90/M | 359 tok/s | 262K | 8 | Compare | |
Qwen3.5 27B OSS2 providers | Alibaba | 37.179 | $0.82/M | 90 tok/s | 262K | 2 | Compare | |
Command A+ OSS | Cohere | 37.157 | $0.00/M | 201 tok/s | 200K | 1 | Compare | |
Qwen3.6 27B OSS3 providers | Alibaba | 37.144 | $1.04/M | 66 tok/s | 262K | 3 | Compare | |
Qwen3.5 35B A3B (Reasoning) OSS5 providers | Alibaba | 37.122 | $0.63/M | 151 tok/s | 262K | 5 | Compare | |
MiniMax-M2 OSS4 providers | MiniMax | 36.087 | $0.52/M | 156 tok/s | 205K | 4 | Compare | |
NVIDIA Nemotron 3 Super 120B A12B (Reasoning) OSS3 providers | NVIDIA | 35.97 | $0.35/M | 506 tok/s | 262K | 3 | Compare | |
Qwen3.5 122B A10B OSS2 providers | Alibaba | 35.871 | $0.82/M | 156 tok/s | 262K | 2 | Compare | |
GLM-4.7 OSS8 providers | Z AI | 34.163 | $0.74/M | 883 tok/s | 205K | 8 | Compare | |
DeepSeek V3.1 Terminus (Reasoning) OSS | DeepSeek | 33.93 | $0.45/M | 29 tok/s | 131K | 1 | Compare | |
Hy3-preview OSS2 providers | Tencent | 33.665 | $0.11/M | 93 tok/s | 328K | 2 | Compare | |
gpt-oss-120b (high) OSS20 providers | OpenAI | 33.266 | $0.08/M | 1801 tok/s | 131K | 20 | Compare | |
DeepSeek V3.2 Exp (Reasoning) OSS | DeepSeek | 32.945 | $0.30/M | 28 tok/s | 164K | 1 | Compare | |
GLM-4.6 (Reasoning) OSS2 providers | Z AI | 32.514 | $0.76/M | 62 tok/s | 205K | 2 | Compare | |
Qwen3.5 9B (Reasoning) OSS | Alibaba | 32.426 | $0.11/M | 49 tok/s | 262K | 1 | Compare | |
Gemma 4 31B OSS5 providers | 32.291 | $0.19/M | 116 tok/s | 262K | 5 | Compare | ||
DeepSeek V3.2 OSS12 providers | DeepSeek | 32.085 | $0.29/M | 276 tok/s | 164K | 12 | Compare | |
Trinity Large Thinking OSS2 providers | Arcee AI | 31.871 | $0.38/M | 212 tok/s | 262K | 2 | Compare | |
Qwen3.6 35B A3B OSS5 providers | Alibaba | 31.527 | $0.39/M | 367 tok/s | 262K | 5 | Compare | |
Qwen3 Max OSS2 providers | Alibaba | 31.377 | $2.40/M | 53 tok/s | 262K | 2 | Compare | |
Gemma 4 26B A4B (Reasoning) OSS7 providers | 31.215 | $0.00/M | 179 tok/s | 1049K | 7 | Compare | ||
Kimi K2 0905 OSS | Kimi | 30.854 | $1.07/M | 21 tok/s | 262K | 1 | Compare | |
Qwen3.5 35B A3B OSS2 providers | Alibaba | 30.685 | $0.39/M | 143 tok/s | 262K | 2 | Compare | |
GLM-4.6 OSS | Z AI | 30.242 | $1.00/M | 59 tok/s | 205K | 1 | Compare | |
GLM-4.7-Flash (Reasoning) OSS3 providers | Z AI | 30.145 | $0.15/M | 262 tok/s | 203K | 3 | Compare | |
Qwen3 235B A22B 2507 (Reasoning) OSS6 providers | Alibaba | 29.544 | $0.10/M | 156 tok/s | 262K | 6 | Compare | |
DeepSeek V3.1 Terminus OSS2 providers | DeepSeek | 28.52 | $0.35/M | 29 tok/s | 164K | 2 | Compare | |
DeepSeek V3.2 Exp OSS | DeepSeek | 28.437 | $0.30/M | 28 tok/s | 164K | 1 | Compare | |
Qwen3 Coder Next OSS3 providers | Alibaba | 28.277 | $0.31/M | 200 tok/s | 262K | 3 | Compare | |
DeepSeek V3.1 OSS7 providers | DeepSeek | 28.126 | $0.35/M | 270 tok/s | 164K | 7 | Compare | |
Mistral Small 4 (Reasoning) OSS | Mistral | 27.803 | $0.26/M | 167 tok/s | 256K | 1 | Compare | |
DeepSeek V3.1 (Reasoning) OSS4 providers | DeepSeek | 27.711 | $0.45/M | 263 tok/s | 164K | 4 | Compare | |
Qwen3 VL 235B A22B (Reasoning) OSS2 providers | Alibaba | 27.642 | $1.72/M | 44 tok/s | 131K | 2 | Compare | |
Magistral Medium 1.2 OSS | Mistral | 27.104 | $2.75/M | 40 tok/s | 131K | 1 | Compare | |
Gemma 4 26B A4B OSS7 providers | 27.086 | $0.14/M | 231 tok/s | 1049K | 7 | Compare | ||
Qwen3.5 4B (Reasoning) OSS | Alibaba | 27.076 | $0.06/M | 191 tok/s | 262K | 1 | Compare | |
DeepSeek R1 0528 OSS4 providers | DeepSeek | 27.071 | $0.91/M | 161 tok/s | 164K | 4 | Compare | |
Qwen3 Next 80B A3B (Reasoning) OSS5 providers | Alibaba | 26.723 | $0.41/M | 335 tok/s | 262K | 5 | Compare | |
GLM-4.5 (Reasoning) OSS | Z AI | 26.419 | $1.00/M | 51 tok/s | 131K | 1 | Compare | |
Kimi K2 OSS2 providers | Kimi | 26.321 | $1.00/M | 47 tok/s | 131K | 2 | Compare | |
Qwen3.5 Omni Flash OSS | Alibaba | 25.869 | $0.28/M | 232 tok/s | 256K | 1 | Compare | |
Seed-OSS-36B-Instruct OSS | ByteDance Seed | 25.157 | $0.30/M | 40 tok/s | 262K | 1 | Compare | |
Qwen3 235B A22B 2507 Instruct OSS11 providers | Alibaba | 24.961 | $0.08/M | 127 tok/s | 262K | 11 | Compare | |
Qwen3 Coder 480B A35B Instruct OSS8 providers | Alibaba | 24.771 | $0.47/M | 192 tok/s | 262K | 8 | Compare | |
Qwen3 VL 32B (Reasoning) OSS | Alibaba | 24.715 | $2.63/M | 97 tok/s | 131K | 1 | Compare | |
gpt-oss-20B (high) OSS11 providers | OpenAI | 24.47 | $0.06/M | 939 tok/s | 131K | 11 | Compare | |
gpt-oss-120b (low) OSS18 providers | OpenAI | 24.467 | $0.10/M | 1867 tok/s | 131K | 18 | Compare | |
MiniMax M1 80k OSS | MiniMax | 24.433 | $0.96/M | 95 tok/s | 1000K | 1 | Compare | |
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) OSS2 providers | NVIDIA | 24.272 | $0.09/M | 305 tok/s | 1000K | 2 | Compare | |
LongCat Flash Lite OSS | LongCat | 23.932 | $0.00/M | 103 tok/s | 256K | 1 | Compare | |
GLM-4.6V (Reasoning) OSS | Z AI | 23.423 | $0.45/M | 42 tok/s | 131K | 1 | Compare | |
GLM-4.5-Air OSS | Z AI | 23.166 | $0.32/M | 60 tok/s | 98K | 1 | Compare | |
Mistral Large 3 OSS3 providers | Mistral | 22.805 | $0.75/M | 186 tok/s | 262K | 3 | Compare | |
Qwen3.5 4B OSS | Alibaba | 22.601 | $0.06/M | 173 tok/s | 262K | 1 | Compare | |
Qwen3 30B A3B 2507 (Reasoning) OSS | Alibaba | 22.413 | $0.75/M | 146 tok/s | 262K | 1 | Compare | |
DeepSeek V3 0324 OSS4 providers | DeepSeek | 22.279 | $0.34/M | 2624 tok/s | 164K | 4 | Compare | |
GLM-4.7-Flash OSS2 providers | Z AI | 22.069 | $0.15/M | 269 tok/s | 200K | 2 | Compare | |
Devstral 2 OSS | Mistral | 22.042 | $0.00/M | 59 tok/s | 262K | 1 | Compare | |
Nemotron 3 Nano Omni 30B A3B Reasoning OSS | NVIDIA | 21.425 | $0.10/M | 296 tok/s | 66K | 1 | Compare | |
Mistral Medium 3.1 OSS | Mistral | 21.255 | $0.80/M | 62 tok/s | 131K | 1 | Compare | |
gpt-oss-20B (low) OSS10 providers | OpenAI | 20.785 | $0.07/M | 907 tok/s | 131K | 10 | Compare | |
Qwen3 VL 235B A22B Instruct OSS3 providers | Alibaba | 20.751 | $0.60/M | 52 tok/s | 131K | 3 | Compare | |
Qwen3 Next 80B A3B Instruct OSS6 providers | Alibaba | 20.106 | $0.34/M | 378 tok/s | 262K | 6 | Compare |
If you are choosing between frontier models, sort by Quality Index first. Then apply filters for cost, output speed, or context length depending on whether your bottleneck is budget, latency, or document size.
This leaderboard helps you shortlist models quickly. For side-by-side decisions, move into Compare or use the LLM Selector to match models to a use case.
GPT-5.5 (xhigh) leads the overall quality ranking right now. The best model for you depends on your use case — coding, cost, speed, and context length all shift the answer.
Sort by Quality Index for overall strength, then filter by price, speed, or context window to match your constraints. Move to the Compare page to put 2–4 finalists head to head.
Open-weight models like DeepSeek, Qwen, and Llama consistently lead on quality-to-price. Use the price axis in the scatter chart to see which models deliver the most quality per dollar.
Data is pulled from Artificial Analysis and refreshed automatically. New models appear as soon as they have benchmark scores and provider endpoints.
Model rankings
Browse the latest ranking pages for overall models, coding, open source, Ollama, long context, and agentic workflows.
Current coding leaderboard using LiveCodeBench, Terminal-Bench, and SciCode.
Top open-weight models for self-hosting, Ollama, and low-cost API use.
Best local AI models by hardware tier for self-hosting on Macs, RTX GPUs, and workstations.
Ollama-first picks for coding, chat, reasoning, and low-friction local inference.
Best long-context models for large documents, codebases, and retrieval-heavy workflows.
Rankings for tool use, multi-step execution, and autonomous agent workflows.