⚡ Loading model data...

Best Agentic AI Models

Q: What are the best AI models in 2026?

As of now, GPT-5.6 Sol leads the overall quality ranking. But the best model depends on your use case — coding, cost efficiency, speed, and context length all shift the answer.

Q: How should I compare AI models?

Start with overall quality, then narrow by what matters for your workload: cost, speed, context window, or a specific capability like coding or tool use.

Q: What is the best cheap AI model?

gpt-oss-120b (high) offers one of the strongest quality-to-cost ratios right now when evaluated against blended token pricing.

Q: What is the fastest AI model in 2026?

Gemma 4 31B currently leads on output speed. Speed matters most for real-time applications and agentic workflows with many sequential tool calls.

Q: Which AI model has the largest context window?

Grok 4.20 0309 v2 (Reasoning) has the largest context window in the ranking. For a dedicated comparison, see the Largest Context Window LLM page on WhatLLM.

The top 30 models ranked for tool use and agents. Updated live from Artificial Analysis data. Last updated: April 2026

Want the full deep dive? See the dedicated agentic ai models guide.

⭐All 💻Coding 🔓Open Source 🖥️Local / Self-Host 🦙Ollama 📄Long Context 🤖Agentic Explore all 327+ models →

Top overall

GPT-5.6 Sol

QI 58.9

Best value

gpt-oss-120b (high)

$0.07/M

Fastest

Gemma 4 31B

2126 tok/s

Largest context

Grok 4.20 0309 v2 (Reasoning)

2.0M

#	Model	Quality	Price/M	Speed	Context
1	GPT-5.6 SolOpenAI	58.9	$11.3	80 tok/s	1.0M
2	GPT-5.6 Sol (xhigh)OpenAI	57.7	$11.3	77 tok/s	1.0M
3	GPT-5.6 Sol (high)OpenAI	55.9	$11.3	71 tok/s	1.0M
4	GPT-5.6 Sol (medium)OpenAI	53.6	$11.3	70 tok/s	1.0M
5	GPT-5.6 Sol (low)OpenAI	49.4	$11.3	74 tok/s	1.0M
6	Claude Fable 5Anthropic	59.9	$20.0	71 tok/s	1.0M
7	GPT-5.6 TerraOpenAI	55	$5.6	163 tok/s	1.0M
8	GPT-5.6 Terra (xhigh)OpenAI	51.6	$5.6	134 tok/s	1.0M
9	GPT-5.6 Terra (high)OpenAI	49	$5.6	135 tok/s	1.0M
10	GPT-5.6 Terra (medium)OpenAI	45.6	$5.6	134 tok/s	1.0M
11	GPT-5.6 Terra (low)OpenAI	40.5	$5.6	128 tok/s	1.0M
12	Claude Opus 4.8Anthropic	55.7	$10.0	71 tok/s	1.0M
13	Claude Sonnet 5Anthropic	53.4	$4.0	100 tok/s	1.0M
14	Grok 4.5 (high)SpaceXAI	53.8	$3.0	121 tok/s	500K
15	GPT-5.6 LunaOpenAI	51.2	$2.3	240 tok/s	1.0M
16	GPT-5.6 Luna (xhigh)OpenAI	49.1	$2.3	222 tok/s	1.0M
17	GPT-5.6 Luna (high)OpenAI	46.1	$2.3	213 tok/s	1.0M
18	GPT-5.6 Luna (medium)OpenAI	38.1	$2.3	203 tok/s	1.0M
19	GPT-5.6 Luna (low)OpenAI	33.3	$2.3	215 tok/s	1.0M
20	Gemini 3.5 Flash (high)Google	50.2	$3.4	240 tok/s	1.0M
21	Gemini 3.5 Flash (medium)Google	45.4	$3.4	234 tok/s	1.0M
22	Gemini 3.5 FlashGoogle	34.9	$3.4	225 tok/s	1.0M
23	GPT-5.5 (xhigh)OpenAI	54.8	$11.3	98 tok/s	1.1M
24	GPT-5.5 (high)OpenAI	53.1	$11.3	95 tok/s	1.1M
25	GPT-5.5 (medium)OpenAI	50.4	$11.3	83 tok/s	1.1M
26	GPT-5.5 (low)OpenAI	43.5	$11.3	92 tok/s	1.1M
27	GPT-5.5OpenAI	35.4	$11.3	88 tok/s	1.1M
28	Claude Opus 4.7Anthropic	53.5	$10.0	71 tok/s	1.0M
29	GPT-5.3 Codex (xhigh)OpenAI	44.3	$4.8	105 tok/s	400K
30	Claude Opus 4.6Anthropic	43.7	$10.0	57 tok/s	1.0M

Showing top 30 of 90 ranked models

View all in Explore →

Rankings by use case

Each guide goes deeper than the quick filters, with methodology, benchmarks, and picks per scenario.

💻Coding 🔓Open Source 🖥️Local / Self-Host 🦙Ollama 📄Long Context 🤖Agentic

Model rankings

Current live rankings

Browse the latest ranking pages for overall models, coding, open source, Ollama, long context, and agentic workflows.

Coding

Best LLM for Coding

Current coding leaderboard using LiveCodeBench, Terminal-Bench, and SciCode.

Open page →

Open Source

Best Open Source LLM

Top open-weight models for self-hosting, Ollama, and low-cost API use.

Open page →

Self-Hosted

Best Local LLM

Best local AI models by hardware tier for self-hosting on Macs, RTX GPUs, and workstations.

Open page →

Ollama

Best Ollama Models

Ollama-first picks for coding, chat, reasoning, and low-friction local inference.

Open page →

Long Context

Largest Context Window LLM

Best long-context models for large documents, codebases, and retrieval-heavy workflows.

Open page →

Agents

Best Agentic Models

Rankings for tool use, multi-step execution, and autonomous agent workflows.

Open page →

How we rank models

Every model is scored using the Artificial Analysis Intelligence Index — a composite of GPQA Diamond, AIME 2025, LiveCodeBench, MMLU-Pro, and other benchmarks, weighted into a single 0-100 quality score. Speed, price, and context window are tracked live across providers.

The overall ranking is a starting point. For production decisions, narrow by use case using the category pages above, then compare finalists head-to-head on Compare.

Frequently Asked Questions

What are the best AI models in 2026?

GPT-5.6 Sol leads on overall quality right now, but the best model depends on your priorities. Coding, cost, speed, and context length all shift the answer. Use the category rankings above to find the right fit.

What is the best cheap AI model?

gpt-oss-120b (high) currently offers one of the best quality-to-cost ratios. Open-source models on providers like Groq or Together can be even cheaper at strong quality levels.

How should I compare AI models?

Start with overall quality index, then narrow by what matters for your workload: cost per million tokens, output speed, context window, or a specific capability like coding or tool use. Use our Compare tool to put finalists head to head.

What is the fastest AI model?

Gemma 4 31B leads on output speed right now at 2126 tokens/second. Speed matters most for real-time applications and agentic workflows with many sequential steps.

Which AI model has the largest context window?

Grok 4.20 0309 v2 (Reasoning) has the biggest context window in this ranking at 2.0M. For a dedicated long-context comparison, see our largest context window page.

How often is this ranking updated?

Data is pulled from Artificial Analysis and refreshed automatically. New models appear as soon as they have benchmark scores and provider endpoints. The ranking reflects the live state of the leaderboard.